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Analytical reagents and mass spectrometry-based methods using these reagents for the rapid, and quantitative analysis of proteins or 
protein function in mixtures of proteins. The methods employ affinity labeled protein reactive reagents having three portions: an affinity 
label (A) covalently linked to a protein reactive group (PRG) through a linker group (L). The linker may be differentially isotopically 
labeled, e.g., by substitution of one or more atoms in the linker with a stable isotope thereof. TTiese reagents allow for the selective isolation 
of peptide fragments or the products of reaction with a given protein (e.g.. products of enzymatic reaction) from complex mixtures. The 
isolated peptide fragments or reaction products are characteristic of the presence of a protein or the presence of a protein function in those 
mixtures. Isolated peptides or reaction products arc characterized by mass spectrometric (MS) techniques. The reagents also provide for 
differential isotopic labeling of the isolated peptides or reaction products which facilitates quantitative determination by mass spectrometry 
of the relative aniount of proteins in different samples. The methods of this invention can be used for qualitative and quantitative analysis of 
global protein expression profiles in cells and tissues, to screen for and identify proteins whose expression level in cells, tissue or biological 
fluids is affected by a stimulus or by a change in condition or cell state of the cell, tissue or organism from which the sample originated. 
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RAPID QUANTITATIVE Analysis OF PROTEINS 
OR PROTEIN FUNCTION IN COMPLEX MIXTURES 

This invention vyas made through funding from the National Science 
Foundation Science and Technology Center for Molecular Biotechnology (grants 
5T32HG and BfR921 4821) and the Nationai Institutes of Health (NIH grants 
RR1 1823, T32HG00035, HD-02274 and QM60184). The United States government 
has certain rights in this invention. 

CROSS-REFERENCE TO RELATED APPLICATIONS 
This application takes priority under 35 U.S.C.§1 1 9(e) from U.S. provisional 
applications serial no. 60/097,788, filed August 25, 1998 and serial no.60/099.1 13 , 
filed Septembers. 1998, both of which are incorporated in their entirety by reference 
herein. 

BACKGROUND OF THE INVENTION 
Genomic technology has advanced to a point at which, in principie, it has 
become possible to determine coriiplete genomic jsequences and to quantitativeiy 
measure the mRNA levels for each gene expressed in a cell. For some species the' 
complete genomic sequence has now been determined, and for 6ne strain of the 
yeast Sacc/7a/Tomyces cervisiae, the mRNA levels for each expressed geiie have 
been precisely quantified under different growth conditions (Veiculeseu et al., 1997). 
Cbniparative cDNA an-ay analysis and related technologies have been used to 
determine induced changes in gene expression at the mRNA level by concurrently 
monitoring the expression level of a large number of genes (in some cases alt the 
genes) expressed by the investigated cell or tissue (Shalon et al., 1996). 
Furthenriore, biological and computational techniques have been used to con-elate 
specific function with gene sequences. The interpretation of the data obtained by 
these techniques in the context of the structui-e. control and mechanism of biological 
systems has been recognized as a considerable challenge. In particular, it has been 
extremely difficult to explain the mechanism of biological processes by genomic 
analysis alone. 
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Proteins are essential for the control arid execution Of virtually every bioto^^^ 
prbcess. The rate of synthesis and the half-life of proteins and thus their expression level 
are also controlled post-transcriptiohally. Fuirthermoi-e. the activity of proteins is frequently 
modulated by post-translational modifications, in particular protein phosphprylation. and 
dependent on the association of the protein with other molebul^s including DMA and 
proteins. Neither the level of expression nor the state of activity of proteins is therefore 
directly apparent frpm the gene sequence or even the expressidn level of ^ 
corresponding mRNA transcript. \t is therefore essential that a complete description of a 
biological system include measurements that indicate the identity; quantity arid the state of 
activity of the proteins which constitute the system. The large-scale (ultimately global) 
analysis of proteins expressed in a cell or tissue has been termed proteome analysis 
(Pennington et al., 1997). 

At present no protein analytical technology approaches the throughput and level of 
automation of genomic technology. The most common implementation of proteom^ 
analysis is based on the separation of complex protein samples most commonly by two- 
dimensional gel electrophoresis (2pE) and the subsequent sequential identification of the 
separated protein species (Ducret etal.. 1998; Gan-els etal.. 1997; Link ^e^^^ 
Shevchenko et al.. 1996; Gygi et al. 1999; Boucherie et al.. 1996). This approach ha§ 
been revolutionized by the development of powerful mass spectrometric techniques and 
the development of computer algorithms which correlate protein and peptide mass spectral 
data with sequence databases and thus rapidly and conclusively Identify proteins (Eng et 
al.. 1 994; Mann and Wilm. 1 994; Yates et al., 1 995). This technology has reached a level of 
sensitivity which now permits the identification of essentially any proteih which is detectable 
by conventional protein staining methods including silver staining (Figeys and Aebersbld. 
1998; Figeys etal.. 1996; Figeys et al.. 1997: Shevchenko etaL, 1996). However, the 
sequential manner In which samples are processed limits the sample throughput, the most 
sensitive methods have been difficult to automate and low abundance proteins, such as 
regulatory proteins, escape detection without.prior enrichment, thus effectively limiting the 
dynamic range of the technique. In the 2DE/(MS)N method, proteins are quantified by 
densitometry of stained spots in the 2DE gels. 
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I The development of methods and instrumentation for automated, data-dependent 
electrospray ionization (ESI) tandem mass spectrometry (MS") in conjunction with 
microcapillary liquid chromatography (pLC) and database searching has significantly 
increased the sensitivity and speed of the identification of gel-separated proteins: As an 
alternative to the 2PE/MS" approach to proteome analysis, the direct analysis by tandem 
mass spectrometry of peptide mixtures generated by the digestion of complex protein - 
mixtures has been proposed (Dongr-e et al., 1997). pLC-Ms/MS has also been used 
successfully for the large-scale identification of individual proteins directly from mixtures 
without gel electrophoretic separation (Linl< etal.. 1999; Opitek et al., 1997) While these 
approaches dramatically accelerate protein identification, the quantities of the analyzed 
proteins cannot be easily determined, and these methods have not been shown to 
substantially alleviate the dynamic range problem also encountered by the 2DeMS/MS 
approach. Therefore, low abundance proteins in complex samples are also difficult to 
analyze by the uLC/MS/MS method without their prior enrichment. ' 

It is therefore apparent that cun-ent technologies, while suitable to identify the 
components of protein mixtures, are neither capable of measuring the quantity nor the state 
of activity of the protein in a mixture. Even evolutionary improvements of the current 
approaches are unlikely to advance their perfomiance sufficiently to make routine 
quantitative and functional proteome analysis a reality. ^ 

This invention provides methods and reagents that can be employed in proteome 
analysis which overcome the limitations inherent in traditional techniques. The basic 
approach described can be employed for the quantitative analysis of protein expression in 
complex samples (such as cells, tissues, and fractions thereof), the detection and 
quantitation of specific proteins in complex samples, and the quantitative measurement of 
specific enzymatic activities in complex samples. 

In this regard, a multitude of analytical techniques are presently available for clinical 
and diagnostic assays which detect the presence, absence, deficiency or excess of a 
protein or protein function associable with a normal or disease state. While these 
techniques are quite sensitive, they do not necessarily provide chemical speciation of 
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products and may. as a result, be difficult to use for assaying several proteins or enzymes 
simultaneously in a single sample. Current methods may not distinguish among aberrant 
expression of different enzymes or their malfunctions which lead to a common set of clinical 
symptoms. The methods and reagents herein can be employed in clinical and diagnostic 
assays for simultaneous (multiplex) monitoring of multiple proteins and protein reactions. 

SUMMARY OF THE INVENTION 
This invention provides analytical reagents and mass spectrometry-based methods 
using these reagents for the rapid, and quantitative analysis of proteins or protein function 
in mixtures of proteins. The analytical method can be used for qualitative and particularly 
for quantitative analysis of global protein expression profiles in cells and tissues. I.e. the 
quantitative analysis of proteomes. The method can also be employed to screen for and 
identify proteins whose expression level in tells, tissue or biological fluids is affected by a 
stimulus (e.g;, administration of a dmg or contact with a potentially toxic material), by a ' 
change in environment (e.g.. nutrient level, temperature, passage of time) or by a change 
in condition or cell state (e.g.. -disease state, malignancy, site-directed mutation, gene 
knockouts) of the cell, tissue or organism from which the sample originated. The proteins 
identified in such a screen can function as markers for the changed state. For example, 
comparisons of protein expression profiles of normal and malignant cells can result in the 
Identification of proteins whose presence or absence is bharacteristic and diagnostic of the 
malignancy. 

In an exemplary embodiment, the methods herein can be employed to screen for 
changes in the expression or state of enzymatic activity of specific prbteins. These 
changes may be induced by a variety of chemicals, including pharmaceutical agonists or 
antagonists, or potentially harmful or toxic materials. The knowledge of such changes may 
be useful for diagnosing enzyme-based diseases and for investigating complex regulatory 
networks in cells. 

The methods herein can also be used to implement a variety of clinical and 
diagnostic analyses to detect the presence, absence, deficiency or excess of a given 
protein or protein function in a biological fluid (e.g.. blood), or in cells or tissued The 
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n!iethod is particularly useful in the analysis of complex mixtures of proteins; i.e.. those 
containing 5 or more distinct proteins or protein functions. 

The inventive method employs affinity-labeled protein reactive reagents that allow 
for the selective isolation of peptide fragmferits or the products of reaction with a given 
protein (e.g.. products of enzymatic reaction) from complex mixtures. The isolated peptide 
fragments or reaction products are characteristic of the presence of a protein or the 
presence of a protein function, e.g.. an enzymatic activity, respectively, in those mixtures. 
Isolated peptides or reaction products are characterized by mass spectrom6tric (WiS) 
techniques. In particular, the sequence of isolated peptides can b6 detemiined using 
tandem MS (MS") techniques, and by application of sequence database searching 
techniques, the protein from which the sequenced peptide originated can be identified. 
The reagents also provide for differential isotopic labeling of the isolated peptides or 
reaction products which facilitates quantitative determination by mass spectrometry of the 
relative amounts of proteins in different samples. Also, the use of differentially isotopically- 
labeled reagents as intemal standards facilitates quantitative determination of the absolute 
amounts of one or more proteins or reaction products present in the sample. 

In general, the affinity labeled protein reactive reagents of this inventibn have three 
portions: an affinity label (A) covalently linked to a protein reactive group (PRG) through a 
linker group (L): 

A-L-PRG 

The linker may be differentially isotopically labeled, e.g.. by substitution of one or more 
atoms in the linker with a stable isotope thereof. For example, hydrogens can be 
substituted with deuteriunris or with C'^ 

The affinity label A functions as a molecular handle that selectively binds covalently 
or non-covalently. to a capture reagent (CR). Binding to CR facilitates isolation of 
peptides, substrates or reaction products tagged or labeled with a: In specific 
embodiments, A is a strepavidin or avidinn. After affinity isolation of affinity tagged 
materials, some of which may be isotopically labeled, the interaction between A and the 
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capture reagent is disrupted or broken to allow MS analysis of the isolated materials! The 
affinity label may be displaced from the capture reagent by addition of displacing ligand. 
which may be free A or a derivative of A. or by changing solvent (e.g.. solvent type or pH) 
or temperature conditions or the linker may be cleaved chemically, fenzymatically, thermally 
or photochemically to release the isolated materials for MS analysis. 

Two types of PRG groups are specifically provided herein: (a) those groups that 
selectively react with a protein functional group to form a covalent or non-covalent bond 
tagging the protein at specific sites, and (b) those that are transformed by action of the 
protein, e.g., that are substrates for an enzyme. In specific embodiments. PRG is a group 
having specific reactivity for certain protein groups, such as specificity for sulfhydryl groups, 
and is useful in general for selectively tagging proteins in complex mixtures. A sulfhydryl 
specific reagent tags proteins containing cysteine. In other specific embodiments. PRG is 
an enzyme substrate that is selectively cleaved (leaving A-L) or modified (giving A-L-PRG') 
by the action of an enzyrne of interest. 

Exemplary reagents have the general formula: 

A-B^.X^(CH2)„-[XMCH2)„ ],.XMCH2)p -X^-B^^PRG 

where: 

A Is the affinity label; 

PRG is the protein reactive group; 

X\ X^. and X^ independently of pne another, and X^ independently of other X^ in 
the linker group, can be selected from O. S. NH. NR. NRR'*, CO; COO. COS, S-S. 
SO. SO2. CO-NR'. CS-NR\ Si-O. aryl or diaryl groups or X'-X^ may be absent, but 
preferably at least one of X^ -X" is present: 

and B^ independently of one another, are optional moieties that can faciliate 
bonding of the A or PRG group to the linker or prevent undesired cleavage of those 
groups from the linker and can be selected, for example, from COO. CO. CO-NR'. 
CS-NR- and may contain one or more CH^ groups alone or in combination with other 
groups, e.g.( CH2 VCONR', (CH2VCS-NR'. or (CH2),; 
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I n, m. p and q are whole numbers that can have values from 0 to about 100, 

preferably one of n. m, p or q is not 0 and x is also a whole number that can range 
from 0 to about 100 where the sum of n+xm+p+q is preferably less than about 100 
and more preferably less than about 20; 
R is an alkyi, alkenyl, alkynyl. alkoxy or aryi group; and 

R' is a hydrogen, an alkyi, alkenyl, alkynyl. alkoxy or aryl group. 

One or more of the CHj groups of the linker can be optionally substituted with small 
(C1-C6) alkyI, alkenyl. or alkoxy groups, an aryl group or can be substituted with functional 
groups that promote ionization, such as acidic or basic groups or groups carrying 
pemianent positive or negative charge. One or more single bonds connecting CH^ groups 
in the linker can be replaced with a double or a triple bond. Prefen-ed R and R' alkyl. 
alkenyl. alkynyl or alkoxy groups are small having 1 to about 6 carbon atoms. 

One or more of the atoms in the linker can be substituted with a stable Isotdpe to 

generate one or more substantially chemically identical, but isotopically distinguishable 
reagents. For example, one or more hydrogens in the linker can be substituted with 
deuterium to generate isotopically heavy reagents. 

In an exemplary embodiment the linker contains groups that can be cleaved to 
remove the affinity tag. If a cleavable linker group is employed, it is typically cleaved after 
affinity tagged peptides, substrates or reaction products have beeii isolated using the 
affinity label together with the CR. In this case, any isotopic labeling in the linker preferably 
remains bound to the protein, peptide, substrate or reaction product. 

Linker groups include among others: ethers, polyethers. ether diamines, polyether 
diamines, diamines, amides, polyamides. polythioethers. disulfides, silyl ethers, alkyl or 
alkenyl chains (straight chain or branched and portions of which may be cyclic), aryl, diaryl 
or alkyl-aryl groups. Aryl groups in linkers can contain one or more hetero^toms (e.g,. N. O 
or S atoms). 
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Jn one aspect, the invention provides a mass spectronnetric method for identification 
and quantitation of one or more proteins in a complex mixture whicii employs affinity 
labeled reagents in which the PRG is a group that selectively reacts with certain groups 
that are typically found in peptides (e.g..sulfhydryl. amino, carboxy. homoserine lactone 
groups). One or more affinity labeled reagents with different PRG groups ar^ introduced 
into a mixture containing proteins and the reagents react with certain pfotbins to t^g them 
with the affinity label. It may be necessary to pretreat the protein mixture to reduce 
disulfide bonds or otherwise facilitate affinity labeling. After reaction with the affinity labeled 
reagents, proteins in the complex mixture are cleaved, e.g., eirizymatically; into a number 
of peptides. This digestion step may not be necessary, if the proteins are relatively small. 
Peptides that remain tagged with the affinity label are isolated by ^h affinity isolation 
method, e.g., affinity chromatography, yia their selective binding to the CR. Isolated 
peptides are released from the CR by displacehnent of A or cleavag^ of the linker, and 
released materials are analyzed by liquid chromatography/mass spectrometry (LC/MS). 
The sequence of one or more tagged peptides is then determined by MS" techniques. At 
least one peptide sequence derived from a protein will be characteristic of that protein and 
be indicative of Its presence in the mixture. Thus, the sequences of the peptides typically 
provide sufficient infomnation to identify one or more proteins present in a mixture^ 

Quantitative relative amounts of proteins in one or more different sarnples containing 
protein mixtures (e.g., biological fluids, cell or tissue lysates, etc.) can be determined using 
chemically identical, affinity tagged and differentially isotopically labeled reagents to affinity 
tag and differentially isotopically label proteins in the different samples. In this method, each 
sample to be compared is treated with a different isotopically labeled reagent to tag certain 
proteins therein with the affinity label: The treated samples are then combined, preferably in 
equal amounts, and the proteins in the combined sample are en^ymatically digested, if 
necessary, to generate peptides. Some of the peptides are affinity tagged and in addition 
tagged peptides orlginatihg from different samples are differentially isotopically labeled. As 
described above, affinity labeled peptides are isolated, released from the capture reagent 
and analyzed by (LC/MS). Peptides characteristic of their protein origin are sequenced using 
MS" techniques allowing identification of proteins in the samples The relative amounts of a 
given protein in each sample is determined by comparing relative abundance of the ions 
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generated from any differentially labeled peptides originating from that protein. The method 
can be used to assess relative amounts of known proteins in different samples. Further, 
since the method does not require any prior knowledge of the type of proteins that may be 
present in the samples, it can be used to identify proteins which are present at different 
levels in the samples examined. More specifically, the method can be applied to screen for 
and identify proteins which exhibit differential express in cells, tissue or biological fluids. It is 
also possible to determine the absolute amounts of specific proteins in a complex mixture: In 
this case, a known amount of internal standard, one for each specific protein in the mixture to 
be quantified, is added to the sample to be analyzed. The intemal standard is an affinity 
tagged peptide that js identical in chemical structure to the affinity tagged peptide to be 
quantified except that the intemal standard is differentially isotopically labeled, either in the 
peptide or In the affinity tag portion. tP distinguish it from the affinity tagged peptide to be 
quantified. The intemal standard can be provided in the sample to be analyzed in other 
ways. For example, a specific protein or set of proteins can be chemically tagged with an 
isotopically-labeled affinity tagging reagent. A known amount of this material can be added 
to the sample to be analyzed. Alternatively, a specific protein or set of proteins may be 
labeled with heavy atom isotopes and theh derivatized with an affinity tagging reagent. 

Also, it is possible ot quantify the levels of specific proteins in multiple samples in a 
single analysis (multiplexing). In this case, affinity tagging reagent s used to derivatize " 
proteins present in different affinity tagged peptides from different samples can be selectively 
quantified by mass spectrometry. 

In this aspect of the invention, the method provides for quantitative measurement of 
specific proteins in biological fluids, cells or tissues and can be applied to determine global 
protein expression profiles in different cells and tissues- The same general strategy can be 
broadened to achieve the proteome-wide. qualitative and quantitative analysis of the state of 
modification of proteins, by employing affinity reagents with differing specificity for reaction 
with proteins. The method and reagents of this invention can be used to identify low 
abundance proteins in complex mixtures and can be used to selectively analyze specific 
groups or classes of proteins such as membrane or cell surface proteins, or proteins 
contained within organelles, sub-cellular fractions, or biochemical fractions such as 
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immunoprecipitates. Further, these methods can be applied to analyze differences^ln 
expressed proteins in different cell states. For example, the methods and reagents herein 
can be employed in diagnostic assays for the detection of the presence or the absence of 
one or more proteins Indicative of a disease state, such as cancer. 

In a second aspect, the invention provides a MS method for detection of the presence 
or absence of a protein function, e.g.; an enzyme activity, in a sample, the method can also 
be employed to detect a deficiency or excess (over nomiai levels) of pfoteln function in a 
sample. Samples that can be analyzed include various biological fluids and materials, 
including tissue and cells. In this case, the PRG of the affinity labeled reagent is a substrate 
for th^ enzyme of interest. Affinity labeled substrates are provided for each ehzyme of 
interest and are introduced into a sample where they react to generate affinity labelW 
products, if the enzyme of interest is present in the sample. Products or unreacted substrate 
that are tagged with the affinity label are isolated by an affinity isblatlon method, e.g.. affinity 
chromatography, via their selective binding to the CR. The isolated tagged substrates and 
products are analyzed by mass spectrometry. Affinity labeled products include those in 
which the substrate is entirely cleaved from the linker or in which the substrate is modified by 
reaction with a protein of interest. Detection of the affinity-labeled product indicates the 
protein function is present in the sample. Detection of little or no affinity labeled product 
indicates deficiency or absence, respectively, of the protein function in the sample. 

The amount of selected protein, e.g., measured in temis of enzyhi^ aictivity, present in 
a sample can be measured by introducing a known amount of an internal standard which Is 
an isotopically labeled analog of the expected product of the enzymatic reaction of the 
reagent substrate. The internal standard Is substantially chemically identical to the expected 
enzymatic reaction product, but is isotopically distinguishable therefrom: The tevel of protein 
function (e.g., enzymatic activity) in a given sample can be compared with activity levels in 
other samples or controls (either negative or positive controls). The procedure therefore can 
detect the presence, absence, deficiency or excess of a protein function in a sample. The 
method is capable of quantifying the velocity of an enzymatic reaction since it nables the 
amount of product formed over a known time period to be measured. This method can be 
multiplexed, by simultaneous use of a plurality of affinity labeled substrates selective for 
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different protein functions and if quantitation is desired by inclusion of the corresponding 
internal standards for expected products, to analyze for a plurality of protein functions in a . 
single sarriple. . ■ 

, DETAILED DESCRIPTION OF THE INVENTION 

The methods of this invention employ affinity tagged protein reactive reagents in 
which the affinity tag is covalently attached to a protein reactive group by a linker. The linker 
can be isotopically labeled to generate pairs or sets of reagents that are substantially 
chenriically identical, but which are distinguishable by rnass. For example a pair of reagents, 
one of which is i^ptopically heavy and the other of which is isotopically light can be employed 
for the comparison of two samples one of which may be a reference sample containing one 
or more known proteins in known amounts. For example, any one or more of the hydrogen, 
nitrogen, oxygen or sulfur atoms in the linker may be replaced with their isotopically stable 
isotopes: ^H, "G, ^H"0,^'^p or **S. 

Suitable affinity tags bind selectively either covalently or non-covaleritly and with high 
affinity to a capture reagent (CR). The CR-A interaction or bond should remain intact after 
extensive and multiple washings with a variety of solutions to remove hon-Specifically bound 
components. The affinity tag binds minimally or preferably not at all to components in the 
assay system, except CR, and does not siginficantiy bind to surfaces of reaction vessel^. 
Any non-specific interaction of the affinity tag with other components or surfaces should be 
disrupted by multiple washes that leave CR-A intact. Further, it must be possible to dismpt 
the interaction of A and CR to release peptides, substrates or reaction products, for example, 
by addition of a displacing ligand or by changing the temperature or solvent conditions. 
Preferably, neither CR or A react chemically with other components in the assay system and 
both groups should be chemically stable over the time period of an assay or experiment. 
The affinity tag preferably does not undergo peptide-like fragmentation during (MS)" analysis. 
The affinity label is preferably soluble in the sample liquid to be analyzed and the CR should 
remain soluble in the sample liquid even though attached to an insoluble resin such as 
Agarose. In the case of CR term soluble means that CR is sufficiently hydrated or otherwise 
solvated such that it functions property for binding to A. CR or CR-containing conjugates 
should not be present in the sample to be analyzed, except when added to capture A. 
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^ Examples of A and CR pairs include: 

d-biotin or structurally modified biotin-based reagents, including d-iminobiotin, whidh 
bind to proteins of the avidin/streptavidin, which may. for example, be used in the 
forms of strepavidin-Agarose. oligomeric-avidin-Agarose, 6r monomeric-avidin- 
Agarose; 

any 1,2-diol. such as 1,2-dihydroxyethane (HO-GHa-CHj-dH). and bthe 
dihyroxyalkanes Including those of cyclic alkanes. e#n.2^dihydroxycycl6h^ 
which bind to an alkyi or aryl boronic acid or boronic acid esters /such as phenyl- 
B(OH)2 or hexyl-B(OEthyl)2 which may be attached via the alkyI or aryl group to a solid 
support material, such as Agarose; 

maltose which binds to maltose binding protein (as well as any other sugar/sugar 
binding protein pair or more generally to any ligand/ligand binding protein pairs that 
has properties discussed above); 

a hapten, such as dinltrophenyl group, for any antibody where the hapten binds to ah 
anti-hapten antibody that recognizes the hapten, for example the dinitrophenyl group 
wiH bind to an anti-dlnitrophenyl-IgG; 

a ligand which binds to a transition metal, for example, ah oligomeric histidine will bind 
to Ni(ll). the transition metal CR may be used in the form of a resin bound chelated 
transition metal, such as nitrilotriacetic acid-chelated Ni(ll) or Iminodiacetic acid- 
chelated Ni(ll): 

glutathione which binds to glutathione-S-transferase. 

In general, any A-CR pair commonly used for affinity enrichment which meets the 
suitability criteria discussed above. Biotin and biotin-based affinity tags are prefen-ed. Of 
particular interest are structurally modified biotins, such as d-iminoblotin. which will elute from 
avidin or strepavidin columns under solvent conditions compatible with ESI-MS analysis. 
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such as dilute acjds containing 10-20% organic solvent. It is expected that d-iminobiotih 
tagged compounds will elute in solvents below pH 4. d-lminobiotin tagged protein reactive 
reagents can be synthesized by methods described herein for the corresponding biotin 
tagged reagents. 

A displacement ligand, DL, is optionally used to displace A from CR. Suitable DLs are 
not typically pi-esent in samples unless added. DL should be chemically and enzymatically 
stable in the sample to be analyzed and should not react with or bind to components (other 
than CR) in samples or bind rion-specifically to reaction vessel walls. DL preferably does 
not undergo peptide-like fragmentation during MS analysis, and its presence in sample 
should not significantly suppress the ionization of tagged peptide, substrate or reaction 
product conjugates. 

DL itself preferably is nriinimally ionized during mass spectrometric analysis and the 
fonmation of ions composed of DL clusters is preferably minimal. The selection of DL, 
depends upon the A and CR groups that are employed. In general, DL is selected to 
displace A from CR in a reasonable time scale, at most within a week of its addition, but 
more preferably within a few minutes or up to an hour. The affinity of DL for CR should be 
comparable or stronger than the affinity of the tagged compounds containing A for CR. 
Furthenmore, DL should be soluble in the solvent used during the elution of tagged 
compounds containing A from CR. DL preferably is free A or a derivative or structural 
modification of A. Examples of DL include, d-biotin or d-biotin derivatives, particulariy those 
containing groups that suppress cluster formation or suppress ionization in MS. 

The linker group (L) should be soluble in the sample liquid to be analyzed and it^ 
should be stable with respect to chemical reaction, e.g., substantially chemically inert, with 
components of the sample as well as A and CR groups. The linker when bound to A should 
not interfere with the specific interaction of A with CR or interfere with the displacement of A 
from CR by a displacing ligand or by a change in temperature or solvent The linker should 
bind minimally or preferably not at all to other components in the system, to reaction vessel 
surfaces or CR. Any non-specific interactions of the linker should be broken after multiple 
washes which leave the A-CR complex intact. Linkers preferably do not undergo peptide-like 
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fragmentation during (MS)" analysis. At least some of the atoms in the linker groiips should 
be readily replaceable with stable heavy-atom isotopes: The linker preferably contains 
groups or moieties that facilitate ionization of the affinity tagged reagents, peptides, 
substrates or reaction products. 

To promote ionization, the linker may contain acidic or basic groups, e.g., COOH. 
SO3H, primary, secondary or tertiary amino groups, nitrogen-heterocycles, ethers, or 
combinations of these groups. The linker may also contairi groups having a pemiahent 
charge, e.g., phosphonium groups, quaternary ammonium groups, sulfonium groups, 
chelated metal ions, tetralky or tetraryl borate or stable carbanions. 

The covalent bond of the linker to A or PRG should typically not be uniritentionally 
cleaved by chemical or enzymatic reactions during the assay. In some cases it may be 
desirable to cleave the linker from the affinity tag A or from the PRG, for example to facilitate 
release from an affinity column. Thus, the linker can be cleavable. for example, by chemical, 
thennal or photochemical reaction. Photocleavable groups in the linker may include the 1 -(2- 
nitrophenyOethyl group. Thermally labile linkers may. for example, be a double-stranded 
duplex formed from two complementary strands of nucleic acid, a strand of a nucleic acid 

with a complementary strand of a peptide nucelic acid, or two complementary peptid 
nucelic acid strands which will dissociate upon heating. Cleavable linkers also include those 
having disulfide bonds, acid or base labile groups, including among others, diarylmethyl or 
trimethylarylmethyl groups, silyl ethers, carbamates, oxyesters. thiesters. thionbesters. and 
a-fluorinated amides and esters. Enzymatically cleavable linkers can contain, for example, 
protease^sensitive amides or esters. P-lactamase-sensitive 3-lactam analogs and linkers that 
are nuclease-cleavable. or glycosidase-cleavable. 

The protein reactive group (PRG) can be a group that selectively reacts with certain 
protein functional groups or is a substrate of an enzyme of interest. Any selectively reactive 
protein reactive group should react with a functional group of interest that is present in at 
least a portion of the proteins in a sample. Reaction of PRG with functional. groups on the 
protein should occur under conditions that do not lead to substantial degradation of the 
compounds in the sample to be analyzed. Examples of selectively reactive PRGs suitable 
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for use in the affinity tagged reagents of this invention, include those which react with 
sulfhydryl groups to tag proteins containing cysteine, those that react with amino groups, 
carboxylate groups, ester groups, phosphate reactive groups, and aldehyde and/or ketone 
reactive groups or. after fragmentation with CNBr. with homoserine lactone. 

Thiol reactive groups include epoxides, a-haloacyl group, nitrites, sulfonated alkyi or 
aryl thiols and maleimides. Amino reactive groups tag amino groups in proteins and include 

sulfonyl halides. isocyanates. isothiocyanantes. active esters, indudihg tetrafluorophen^^ 
esters, and N.hydroxysuccinimidyl esters, acid halides. and acid anyhydrides. In addition, 
amino reactive groups include aldehydes or ketones in the presence or absence of NaBH^ or 
NaCNBHa. 

Carboxylic acid reactive groups include amines or alcohols in the presence of a 
cpupling agent such as dicyclohexylcarbodiimide, or 2.3.5.6-tetrafluorophenyl trifluorbacetate 
and in the presence or absence of a coupling catalyst such as 4^dimethylamin6pyridine; and 
transition metaWiamine complexes including Cu(ll)phenanthroii^^ 

fester reactive groups include amines which, for example, react with homoserine 
lactone. 

Phosphate reactive groups include chelated metal where the metal is, for example 
Fe(IH) or Ga(lll), chelated to. for example, hitrilotriacetiac acid or iminodiacetic acid. 

^ Aldehyde or ketone reactive groups include amine plus NaBH^ or NaCNBHa. or these 
reagents after first treating a cart)ohydrate with periodate to generate an aldehyde or ketone. 

PRG groups can also be substrates for a selected enzyme of interest. The enzyme of 
interest may. for example, be one that is associated with a disease state or birth defect or 
one that is routinely assayed for medical purposes. Enzyme substrates of interest for use 
with the methods of this invention include, acid phosphatase, alkaline phosphatase, alanine 
aminotransferase, amylase, angiotensin converting enzyme, aspartate aminotransferase, 
creatine kinase, gamma-glutamyltransferase. lipase, lactate dehydrogenase, and 
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glucose-6-phosphate dehydrogenase which are currently routinely assayed by other 
methods. 

The requirements discussed above for A. L. PBG. extend to the con-esporidirig to the 
segments of A-L-PRG and the reaction products generated with this reagent. 

Internal standards, which are appropriately isotoplcally labelled, may be employed In 
the methods of this invention to measure absolute quantitative amounts of proteins In 
samples. Internal standards are of particular use in assays intended to qti^ntitke affinity 
tagged products of enzymatic reactions. In this application, the internal stahdard is 
chemically identical to the tagged enzymatic product generated by the action of the enzynie 
on the affinity tagged enzyme substrate, but carries isotope labels which may include ^H, "C, 
'^N, ''^O, '^O, or ^S, that allow it to be independently detected by MS techniques. Internal 
standards for use In method herein to quantitative one dr several proteing in a sampfe are 
prepared by reaction of affinity labeled protein reactive reagents with a knbwn protein to 
generate the affinity tagged peptides generated from digestion of the tagged protein. Affiftity 
tagged peptides internal standards are substantially chemically Identical to the corresponding 
affinity tagged peptides generated from digestion of affinity tagged protein, except that they 
are differentially isotoplcally labeled to allow their independent detection by MS techniques; 

The method of this invention can also be applied to determine the relative quantities of 
one or more proteins in two or more protein samples, the proteins in each sartiple are 
reacted with affinity tagging reagents which are substantially chemically identical but 
differentially isotoplcally labeled. The samples are combined and processed as one. The 
relative quantity of each taggg^d peptide which reflects the relative quantity of the protein 
from which the peptide originates is determined by the measurement of the respective 
isotope peaks by mass spectrometry. 

The methods of this invention can be applied to the analysis or comparison of multiple 
different samples. Samples that can be analyzed by methods of this invention include cell 
homogenates; cell fractions; biological fluids including urine, blood, and cerebrospinal fluid; 
tissue homogenates; tears; feces; saliva; lavage fluids such as lung or peritoneal lavages; 



16 



wo 00/11208 



PCT/US99/19415 



mixtures of biological molecules including proteins, lipids, carbohydrates and nucleic 
acids generated by partial or complete fractionation of cell or tissue homogenates. 

The methods of this invention employ MS and (MS)" methods. While a variety of MS 
and (M^)" are availabel and may be used in these methods, Matrix Assisted Laser 
Desorption Ionization MS (MALDI/MS) and EI6ctrospray Ionization MS (ESI/MS) methods 
are preferred. 

Quantitative proteome analysis 

This method is schematically illustrated in Scheme 1 using a biotin labeled sulfhydryl-reactive 
reagent for quantitative protein profile measurements in a sample protein mixture and a 
reference protein mixture. The method comprises the following steps: 

Reduction. Disulfide bonds of proteins in the sample and reference mixtures are ' 
reduced to free SH groups. The prefen-ed reducing agent is tri^h-butylphosphine which is 
used under standard conditions. Alternative reducing agents include mercaptoethylamihe 
and dithiothreitol. If required, this reaction can be perfonned in the presence of solubilizing 
agents including high concentrations of urea and detergents to maintain protein solubility. 
The reference and sample protein mixtures to be compared are processed separately, 
applying identical reaction conditions; 

^ Derivatization of SH groups with an affinity tag. Free SH groups are derivatized with 
the biptinylating reagent biotinyl-iodoacetylamidyl-4.7,10 trioxatridecanedia mine the 
synthesis of which is described below. The reagent is prepared in different isotopically 
labeled fonns by substitution of linker atoms with stable isotopes and each sample is 
derivatized with a different isotopically labeled form of the reagent. Derivatization of SH 
groups is preferably performed under slightly basic conditions (pH 8.5) for 90 min at RT. For 
the quantitative, comparative analysis of two samples, one sample each (temied reference 
sample and sample) are derivatized as illustrated in Scheriie 1 with the isotopically light and 
the isotopically heavy fonn of the reagent, respectively. For the comparative analysis of 
several samples one sample is designated a reference to which the other samples are 
related to. Typically, the reference sample is labeled with' the isotopically heavy reagent and 
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the experimental samples are labeled with the isotopically light fomi of the reagent, althbugh 
thi^ choice of reagents is arbitrary. These reactions are also compatible' with the presence 6f 
high concentrations of solubilizing agents; 

Combination of labeled samples. After completion of the affinity tagging reaction 
defined aliquots of the samples labeled with the isotopically differentleagents (e.g., heavy 
arid light reagents) are combined and ail the subsequent steps are perfomied on the pooled 
samples. Combination of the differentially labeled samples at this eariy stage of the 
procedure eliminates variability due to subsequent reactions and manipulations. Preferably 
equal amounts of each sample are combined; - 

Removal of excess affinity tagged reagent. Excess reagent is adsorbed; for eximpleV 
by adding an excess of SH-containing beads to the reaction mixture after protein SH groups 
are completely derivatized. Beads are added to the solution to achieve about a 5-fbld molar 
excess of SH groups over the reagent added and incubated for 30 min at RT. After the 
reaction the beads are be removed by centrifugation; 

Protein digestion. The proteins in the sample mixture are digested, typically with ! 
trypsin. Alternative proteases are also compatible with the procedure as in fact are ch^itiicai 
fragmentation procedures. In cases in which the preceding steps werd perfomied in the 
presence of high concentrations of denaturing solubilizing agents the sample mixture are 
diluted until the denaturant concentration is compatible with the activity of the proteases 
used. This step may be omitting in the analysis of small proteins: 

Affinity Isolation of the affinity tagged peptides by interaction with a capture reagent. 
The biotlnylated peptides are isolated on avidin-agarose. After digestion the pH of the 
peptide samples is lowered to 6.5 and the biotinylated peptides are immobilized on bfeads 
coated with monomeric avidin (Pierce). The beads are extensively washed. The last washing 
solvent includes 10% methanol to remove residual SDS. Biotlnylated peptides are eluted 
from avidin-agarose, for example, with 0.3% forniic acid at pH 2; 
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Analysis Of the isolated, derivatized peptides by MLC-MS"or CE-MS" with data 
dependent fragmentation. Methods and instmrneht control protocols well-known in the art 
and described, for example, in Ducret et al.. 1998; Figeys and Aebersold, 1998; Figeys et al.. 
1Q96; or Haynes et al., 1998 are used: 

In this last step, both the quantity and sequence identity of the proteins from which the 
tagged peptides originated can be determined by automated multistage MS. This is 
achieved by the operation of the mass spectrometer in a dual mode in which it alternates In 
successive scans between measuring the relative quantities of pe(3tides 6luting from the 
capillary column and recording the sequence infonnation of selected peptides. Peptides are 
quantified by measuring in the MS mode the relative signal intensities for pairs of peptide 
ions of identical sequence that are tagged with the isotopically light or heavy forms of the 
reagent, respectively, and which therefore differ in mass by the mass differential encoded 
within the affinity tagged -reagent. Peptide sequence information is automatically generated 
by selecting peptide Ions of a particular mass-to-charge (m/z) ratio for colHsion-induc^d 
dissociation (GID) in the mass spectrometer operating in the MS" mode. (Link. A.J. et al. 
(1997). Gygl. S.P. et al. (1999). and Gygi, S.P. et al. (1999)). The resulting CI D spectra are 
then automatically con-elated with sequence databases to identify the protein from which the 
sequenced peptide originated. Combination of the results generated by MS and MS" 
analyses of affinity tagged and differentially labeled peptide samples therefore detenrjihes 
the relative quantities as well as the sequence identities of the components of protein 
mixtures in a single, automated operation. 

Results of this applying this method using the biotinylated sulfhydryf reagent and to 
the quantitative analysis of synthetic peptide samples, to the relative quantitation of the 
peptides in a protein digest an the tandem mass spectral analysis of a derivatized peptide 
are shown in Fig. 1, Table L and Fig. 2. respectively. 

This method can also be practiced using other affinity tags and other protein reactive 
groups, including amino reactive groups, carboxyl reactive groups, or groups that react with 
homoserine lactones. 
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The approach employed herein for quanitative proteome analysis is based on two 
principles. First, a short sequence of contiguous amino acids from a protein (5-25 residues) 
contains sufficient information to uniquely identify that protein. Protein identification by MS" 
is accomplished by correlating the sequence information contained in the CI D mass 
spectrum with sequence databases, using sophisticated computer searching algorithms 
(Eng. J. et al. (1994); Mann. M. et al. (1994); Qin. J. et al. (1997); Clauser. K.R et al. (1995)). 
Second, pairs of identical peptides tagged with the light and heavy affinity tagged reagents, 
respectively, (or in analysis of more than two samples, sets of ideritical tagged peptides in 
which each set member is differentially isotopically labeled) are chemically identical and 
therefore serve as mutual internal standards for accurate quantitation. The MS 

measurement readily differentiates between peptides originating from different samples 
representing for example different cell states, because of the difference between isbtoplcally 
distinct reagents attached to the peptides. The ratios between the intensities of the differing 

weight components of these pairs or sets of peaks provide an accurate measure of the 
relativeabundance of the peptides (and hence the proteins) in the original cell pools IJetause 
the MS intensity response to a given peptide Is independent of the isotopic composition of 
the reagents (De Leenheer. A.P. et al (1992). The use of isotopically labeled internal 
standards is standard practice in quantitative mass spectrometry and has been exploited to 
great advantage in. for example, the precise quantitation of drugs arid metabolites in bodily 
fluids (De Leenheer. A.P. et al. (1992). 

In another illustration of the method, two mixtures cohsisling of the same six proteins 
at known, but different, concentrations were prepared and analyzed. The protein mixtures 
were labeled, combined and treated as schematically illustrated in Scheme 1. The isolated, 
tagged peptides were quantified and sequenced in a single combined pLC-MS and pLC-MS" 
experiment on an ESI ion trap mass spectrometer. All six proteins were unambiguously 
identified and accurately quantified (Table 2). Multiple tagged peptides were encountered for 
each protein. The differences between the obseived and expected quantities for the six 
proteins ranged between 2 and 12%. 

The process is further illustrated for a single peptide pair in Figs. 3A-C. A single scan 
of the mass spectrometer operated in MS mode is shown in Fig. 3A. Four pairs of peptide 
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ions Characterized by the mass differential encoded in the affinity tagged reagent are 
detected in this scan and indicated with their respective m/z values. The scan shown was 
acquired in 1.3 s. Over the course of the one-hour chromatographic elution gradient, more 
than 1200 such scans were automatically recorded. Fig. 3B shows an expanded view of the 
mass spectrum around the ion pair with m/z ratios of 9i93.8 and 977.7, respectively. Co- 
elution and a detected mass differential of four units potentially identifies thd ions as a pair of 
doubly charged affinity tagged peptides of identical sequence (mass difference of eight and a 
charge state of two). Fig. 3C shows the reconstmcted ion chrbmatograms for thes^ two 
species. The relative quantities were determined by integrating the contour of the respective 
peaks. The ratio (light/heavy) was detentiined as 0.54 (Tablet). The peaks in the 
reconstructed ion chromatograms appear serrated because in every second scan the rhass 
spectrometer switched between the MS and the MS" modes to collect sequence ihfomiation 
(CID mass spectrum) of a selected peptide ion. These CID spectra were used to Identify the 
protein from which the tagged peptides originated. Fig. 4A shows the CID spectrum 
recorded from the peptide ion with m/z = 998 (marked with an arrow in Fig. 3A). Database 

searching with this CID spectrum identified the protein as glyceraldehyde-3-phosphate 
dehydrogenase (Fig. 4B) which was a member of the protein mixture. 

Several beneficial features of the this method are apparent, hrst. at least two 
peptides were detected from each protein in the mixture, therefore, both quantitatidh and 
protein Identification can be redundant. Second, the identified peptides all contained at le^st 
one tagged cysteinyl residue. The presence of the relatively rare cysteinyl residue In a 
peptide adds an additional powerful constraint for database searching (Sechi. S. et al. 
(1998). Third, tagging and selective enrichment of cysteine-containing peptides significantly 
reduced the complexity of the peptide mixture generated by the concurrent digestion of six 
proteins. For this protein mixture, the complexity was reduced from 293 potential tryjatic 
peptides to 44 tryptic peptides containing at least one cysteinyl residue. Fourth, the peptide 
samples eluted from the avidin affinity column are directly compatible with analysis by pLC- 
MS". 
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Quantitative analysis of protein expression in different ceil states 

' The protein reactive affinity reagent strategy was applied to study differences In 
steady-state protein expression in the yeast, S. cerevisiae. In two non-glucose repressed 
states (Table 3). Cells were han/ested from yeast growing in log-phase utilizing either 2% 
galactose or 2% ethanol as the carbon source. One-hundred pg of soluble yeast jDrbteiri 
from each cell state were labeled independently with the isotopically different affinity tagged 
reagents. The labeled samples were combined and subjected to the strategy described In 
Fig, 1 . One fiftieth (the equivalent of approximately 2 pg of protein from each cell state) of 
the sample was analyzed. 

Glucose repression causes large numbers of proteins with metabolic functions 
significant to growth on other carbon sources to be minimally expressed (Ronne, H. (1995; 
Hodges, P.E. et al. (1999)). Growth on galactose or ethanol with no glucose present results 
in the expression of glucose repressed genes. Table 3 presents a selection of 34 yeast 
genes encountered in the analysis, but It contains every known glucose-repressed genes 
that was identified (Mann, M. et al. (1994). Each of these genes would have been minimally 
expressed in yeast grown on glucose. Genes specific to both growth on galactose (GAL1 , 
GAL1 0) as well as growth on ethanol (ADH2, ACH1 ) were detected and quantitated. 

The quantitative nature of the method is apparent In the ability to accurately measure 
small changes in relative protein levels. Evidence of the accuracy of the measurements can 
be seen by the excellent agreement found by examining ratios for pnDteiris for which multiple 
peptides were quantified. For example, the five peptides found from PCK1 had a rtieari ratio 
±95% confidence intervals of 1.57±0.15, and the percent endr was <10%. In addition, the 
observed changes fit the expected changes from the literature (Ronne, H. 1 995; Hodges, 
P.E. et al. (1999)). Finally, the observed changes are in agreement with the changes In 
staining intensity for these same proteins examined after two-dimensional gel electrophoresis 
(data not shown). 

The alcohol dehydrogenase family of isozymes in yeast facilitates growth on either 
hexose sugars (ADH1) and ethanol (ADH2). The gene ADH2 encodes an enzyme that is 
both glucose- and galactose-repressed and permits a yeast cell to grow entirely on ethanol 
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by converting it into acetaldehyde which enters the TCA cycle (Fig. 5A). In the presence of 
sugar, ADH1 performs the reverse reaction converting acetaldehyde into ethanol. The 
regulation of these isozymes is l<ey to carbon utilization in yeast (Ronne. H. (1 995)). the 
ability to accurately measure differences in gene expression across families of isozymes is 
sonietimes difficult using cDNA array techniques because of cross hybridization (DeRisi, J.L. 
et al. (1997)). The method of this invention applied as illustrated in Fig.1 succeeded in 
measuring gene expression for each isozyme even though ADHt and ADH2 share 93% 
amino acid (88% nucleotide) sequence similarity. This was because the affinity tagged 
peptides from each isozyme differed by a single amino acid residue (valine to threonine) 
which shifted the retention time by more than 2 min and the mass by 2 daltons for the AbH2 
peptides (Fig. 5B). ADH1 was expressed at approxiniately 2-fold high levels when galactose 
was the carbon spurce compared with ethanol. Ethanol-induction of ADH2 expression 
resulted in more thaA2Q0-fold increases conripared with galactose-induction. 

The results described above illustrate that the method of this invention provides 
quantitative analysis of protein mixtures and the identification of the protein components 
therein in a single* automated operation. 

The method as applied using a sulfhydryl reactive reagent significantly reduces the 
complexity of the peptide mixtures because affinity tagged cysteine-containing peptides are 
selectively isolated. For example, a theoretical tryptic digest of the entire yeast proteome 
(6113 proteins) produces 344.855 peptides, but only 30.619 of these peptides contain a 
cysteinyl residue. Thus, the complexity of the mixture is reduced, while protein quantitation 
and identification are still achieved. The chemical reaction in of the sulfliydryl reagent with 
protein can be perfonned in the presence of urea, sodium dodecyl sulfate (SDS). salts and '■. 
other chemicals that do not contain a reactive thiol group. Therefore, proteins can be kept in 
solution with powerful stabilizing agents until they are enzymatically digested. The sensitivity 
of the MLC-MS"system is dependent of the sample quality. In particular, commonly used 
protein solubilizing agents are poorly compatible or incompatible with MS. Affinity purification 
of the tagged peptides completely eliminates contaminants incompatible with MS. The 
quantitation and identification pf low abundance proteins by conventional methods requires 
large amounts (milligrams) of starting protein lysate and involves some type of enrichment for 
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these low abundance proteins. Assays described above, start with about 100 pg of protein 
and used no fractionation techniques. Of this, approximately 1/50 of the protein was 
analyzed In a single pLC-MS" experiment. This system has a limit of detection of 10-20 fitiol 
per peptide (Gygi. S.P. et al. (1 999)). For this reason. In the assays described which employ 
pLC-MS" only abundant proteins are detected. However, the methods of this invention are 
compatible with any biochemical, immunological or cell biological fractionation methods that 
reduce the mixture complexity and enrich for proteins of low abundaride While quantitation is 
maintained. This method can be redundant in both quantitation arid identification if rtlultlple 
cysteines are detected. There is a dynamic range associated with the ability of ^e method^ 
to quantitate differences in expression levels of affinity tagged peptides which is depe^^^ 
on both the intensity of the peaks corresponding the peptide pair (or set) and the overall 
mixture complexity. In addition, this dynamic range will be different foF each type^b^^^^^^ 
spectrometer used. The Ion trap was employed in assays described herein because of ite 
ability to collect impressive amounts of sequencing infonnation (thousands of proteins can 
potentially be identified) in a data-dependerit fashion even though it offers a more limiteci 
dynamic quantitation range. The dynamic range of the ion trap (based bri signal-to-noise 
ratios) varied depending on the signal intensity of the peptide pair and complexity of the 
mixture, but differences of up to 1 00-fold were generally detectable and even larger 
differences could be detemiined for more abundant peptides. In addition, protein expression 
level changes of more than 100-200.fold sfill identify those proteins as major poterife^^^^ 
contributors to the phenotypic differences between the two original cell states. The method 
can be extended to include reactivity toward other functional groups. A sm^ll percentage of 
proteins (8% for S. cereyisiae) contain no cysteiriyi residues and are therefore missed by 
analysis using reagents with sulftiydryl group specificity (i.e .thibl grbup specificity). Affinity 
tagged reagents with specificities toward functional groups other than sulfhydryl groups will 
also make cysteine-free proteins susceptible to analysis. 

The methods of this invention can be applied to analysis of low abundance proteins 
and classes of proteins with particular physico-chemical properties including poor solubility, 
large or small size and extreme p/ values. 
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The prototypical applicatjon of the Chemistry and method Is the estab^^ 
quantitative profiles of complex protein sampleis and ultimately total lysates of cells and 
tissues following the prefen-ed method described above. In addition the reagents and 
methods of this invetion have applications which go beyond the determination of protein 
expression profiles. Such applications include the following: 

Application of amino-reactive or sulfhydryl-reactlve; differentially isotopically labeled 
affinity tagged reagents for the quantitative analysis of proteins in immuno precipitated 
complexes. In the prefenred version of this technique jarotein complexes from cells 
representing different states (e.g., different states of activation, different disease states, 
different states pf differentiation) are precipitated with a specific reagent, preferably an 
antibody. The proteins in the precipitated complex are then derivatized and analyzed as 
above. 

Application of amino-reactive. differentially isotopically labeled affinity tagged reagents 
to detennine the sites of induced protein phosphorylation. In a preferred version of this 
method purified proteins (e.g.. immunoprecipitated from cells under different stimulatory 
conditions) are fragmented and derivatized as described above. Phosphopeptides are 
identified in the resulting peptide mixture by fragmentation In the ioii source of the ESI-MS 
instrument and their relative abundances are determined by comparing the ion signal 
intensities of the experimental sample with the intensity of an included, isotopically labeled 
standard. 

Amino-reactive, differentially isotopically labeled affinity tagged reagents are used to 
identify the AZ-temiinal ion series in MS" spectra. In a preferred version of this application, the 
peptides to be analyzed are derivatized with a 50:50 mixture of an isotopically light and 
heavy reagent which is specific for amino groups. Fragmentation of the peptides by CID 
therefore produce two /V-tenninal ion series which differ in mass precisely by the mass 
differential of the reagent species used. This application dramatically reduces the difficulty ih 
determining the amino acid sequence of the derivatized peptide. 
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Quantitative Analysis of Surface Proteins in Cells and Tissue 

' The cell exterior membrane and its associated proteins (gbII surface ^p^^ 

participate in sensing external signals and responding to environmental cues. Changes in 
the abundance of cell surface proteins can reflect a specific cellular state or the ability of a 
cell to respond to its changing environment. Thus, the cpmprehenslve. quantitative 
characterization of the protein components of the cell surface can identify marker proteins or 
constellations of marker proteins characteristic for a particular cellular state, or ^Xfilain the 
molecular basis for cellular responses to external stimuli. Indeed, changes In expression of a 
number of cell surface receptors such as Her2/neu, erbB; IGFI receptor, and EGF receptor 
have been implicated in carcinogenesis and a current imhfiunologicat therapeutic apprb^ 
for breast cancer is based on the infusion of an antibody (Herceptin, Gehenteeh, Palp Altb. 
CA) that specifically recognizes Her2/neu receptor. 

Cell surface proteins are also experimentally accessible. Diagnostic assays for cell 
classificatldn and preparative isolation of specific cellis by methods such as cell sorting or 
pannirig are based on cell surface proteins, Thuis, differential analysis of cell surface ^ 
proteins between normal and diseased (e.g.. cancer) cells can identify important diagriokic 
or therapeutic targets. While the importance of cell surface proteins for diagnosis and 
therapy of cancer has been recognized, membrane proteins have been difficult to analyze, 
Due to their generally poor solubility they tend to be under-represented in standard 2D gel 
electrophoresis patterns and attempts to adapt 2D electrophoresis conditions to the 
separation of membrane proteins have met limited success. The method of this invention 
can overcome the limitations inherent in the traditional techniques. 

The analysis of membrane proteins is challenging because they generally are difficult 
to maintafn in solution under conditions that are compatible with high sensitivity Analytical 
instmments such as mass spectrometers. The application of the methods of the present 
invention to the analysis of membrane proteins is exemplifed using human T cell lymphoma 
cell line Jurkat for membrane protein labeling and extraction and the well characterized 
human prostate epithelial cell line P69SV40T and two P69SV40T sublines which differ in 
IGF-1 receptor expression by factor of 1 0 to exemplify quantitative, differential analysis of 
membrane proteins. 
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Jurkat cells are an appropriate model system because the cells are easy to grow in 
large numbers and because the modulation of cell surface proteins in response to different 
stimuli and experimental conditions has been well characterized in T lymphocytes. 
Commercially available biotinylating reagents or more generally affinity tagging reagents are 
employed to derivatize lysine residues and the free N-termini. Water soluble biotinylating 
reagents such as Sulfo-NHS (N-hydroxy succinimide) biotin and analogs (Sulfosuccinimidyl- 
6-{biotinamido)-hexanoate. Pierce. Rockford. IL) which have been used extensively for 
labeling cell surface proteins can be employed.: The reaction of NHS esters with primary 

amines Is best at neutral pH values and above and is compatible with the presence b^^ 
organic solvent such as DMSO or DMF. Biotinylation of cell surface proteins from the Jurkat 
cells Is candied ,out in PBS buffer at pH 7.2. Cells (1 x iO') are washed with PBS buffer to 
remove contaminating serum and other proteins from the culture medium, the cells are 
resuspended at25 x 10« cell/ml and reacted with 0.5 mg/ml bf Sulfo-NHS-Biotin (Pierce, 
Rockford. IL) for 30 min at RJ., The labeled cells are washed twice with cold PBS to remove 
unreacted biotinylating reagent. Biotinylated cells are sdlubilized 5 x 10' cells/ml In lysis 
buffer containing 1% Triton X-1 14. Triton X-1 14 has the property of phase-partitioning into 
detergent phase and aqueous phase at 30«C. Following the phase partitioning, detergent 
phase is removed from the aqueous phase by centrifugation at SOOxg. Phase partitioning 
has previously been successfully used to enrich ceir membrane. Also, this technique was 
found to enrich membrane proteins from Juricat cell lysates . Triton phase is diluted 1:5 (v/v) 
using 50 mM ammonium bicarbonate buffer. pH 8.5. and high-purity, modified porcine-trypsin 
is added to digest the proteins at a concentration of 12.5 ng/ml for overnight at 37°C. 
Trypsin is neutralized by the addition of a cocktail of serine protease inhibitors and tryptic 
peptides are isolated by the avidin affinity chromatography techniques. Eluted peptides are 
separated e.g,. by pLC methods and identified by searching peptide sequenc databases, 
using for example, the Sequest program. 

The human prostate epithelial cell line P69SV40T which was immortalized with SV 40 
T antigen has been well characterized . This cell line is immortal but not tuniorigenic and 
expresses type 1 insulin like growth factor receptor (IGF-1R) at 2 x 10^ receptors per cell. A 
subline, called M12, was derived from P69SV40T by sequential passage in male athymic 
nude mice . This cell line is highly tumorigenic and metastatic and expresses 1.1 x 10^ IGF- 
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1 R per cell. The relative difference in the abundance of IGF-1 R in the cell lines P69SV40T 
anil M12 can be quantitatively detemiined using methods of this invention adapted for 
application to membrane proteins. Since the number of IGF-1 R for these cell lines has 
already been detemiined, this well characterized system can provide a refererice to validate 
the efficiency of the quantitative methods of this invention 

P69SV40T cells (1 x 10') are biotinylated with an isotopically heavy bibtin tagged 
amino reactive reagent and the M12 cells (1 x 10') are biotinylated with a Corresponding 
isotopically light amine reactive biotin tagged amino reactive reagent: IGF-1 R is then 
immunoprecipitated from the combined lysate of both cell lines using an antlbbdy against 
human IGF-1 R and the total mass of immunoprecipitated proteins is digested with trypsin- 
Trypsin is then neutralizecJ, e,g.. by the addition of inhibitors and tagged peptides are purified 
by biotin-avidin affinity chromatography. The eluted peptides are analyzed by LC-MS and 
LC-MS^for peptide quantitation and icjentification, respectively, as has been described 
above. Quantitation in this experinient is facilitated by the option to us6 selective ion 
monitoring in the MS. In this mode only the masses of tagged peptide ions expected to 
d rive from IGF-1 R need be monitored. 

The described technique can be applied to compare the differences in relative 
abundance of cell surface proteins between, parental prostate cell line (P69SV40T) and Mi 2 
cells to detect and identify those cell surface proteins whose expression level isdiffererit in 
the two cell lines and which may be characteristic of the different cell states. UsinQ the 
methods described herein global, relative quantitation of the cell surface proteins in any two 
or more cell lines can be analyzed to detect and identify those cell surface proteins ' 
characteristic of the different cell states. Results can be independent confinned using 
procedure such as ID or 2D gels, if applicable, or quantitative western blotting to confimi 
quantitation results. 

It Is expected that the experirhental variability of quantitation of cell surface proteins 
will will be onsiderably better than the accuracy of quantitation achieved by cuirentiy 
available cDNA array technology. In addition to relative protein quantity and identity, the 
method can also be used to reveal the orientation of the protein in the membrane, based on 
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the presumption that intact, alive cells will exclude the biotinylating reagent. 

Alternative nnethdds can be applied to enhance the selectivity for tagged peptides 
derived from cell surface proteins. For example, tagged cell'surface proteins can be 
trysinized directly on the intact cells to generate tagged peptides, purified and analyzed as 
discussed. In addition, traditional cell membrane preparations may be used as an inltltal step 
to enrich cell surface proteins. These methods can include gentle cell lysis with a dourice 
homogenizer and series of density gradient centrifugations td isolate membrane proteins 
prior to proteolysis.. This method can provide highly enriched preparations of cell surface 
proteins. Affinity tagged proteins may also be isolated by affinity chromatography prior to 
proteolysis as well as after proteolysis. This chromatography can be perfdnned in the 
presence of surfactants such as TX-100, NP-40 or Tween-20 to maintain protein solubility. 
The sequential application of affinity chromatography steps (one for the intact protein and 
one for the tagged peptide fragments) provides a high degree of selebtlvity. These 
altrematlve methods are easily scalable for the detection of low abundance membrane 
proteins and the relative quantity of tagged peptides tagged is maintained through the 
selective enrichment steps. 

In the application of the methods of this invention to cell surface proteins, once the 
tagged proteins are fragmented, the tagged peptides behave no differently from the peptides 
generated from more soluble samples. 

Synthesis of affinity tagged protein reactive reagents that are selective for certain 
proteins groups 

Synthetic routes exemplary affinity tagged reagents suitable for use in the methods of 
this invention are provided in Schemes 2-3 where well-known synthetic techniques are 
employed in synthesis of the non-deuterated and deuterated reagents. . 

Bidtinyl-iodoacetylamidyl-4.7.10 trioxatridecariediamine 4 (Scheme 3) consists of a 
biotin group, a chemically inert spacer of capable of being isotopically labelled with stable 
isotopes and a iodoacetamidyl group, respectively. The biotin group is used for affinity 
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enrichment of peptides derivatized with the reagent, the ethylene glycol linker |s differentially 
isdtopically labeled for mass spectral analysis and the iodoacetamidyl group provides 
specificity of the reagent for sulfhydryl-containing peptides. The reagent an be synthesized 
in an all hydrogen fomn (isotopically light fomi) with and with 1-20, and preferably 4-8 
deuterium atoms in the linker (isotopically heavy forms). 

Analysis of Velocities of Multiple Enzymes in Cell ^ 

Monitoring enzyme functions by biochemical assays is an essentiardiagnostlc tool 
that employs a multitude of analytical techniques including spectrophotometnc. fluprdmetric. 
and radiometric detection of products. However, cun-ent methods are difficult to Use for 
assaying several enzymes simultaneously In a single sample. Mass spectmmetry for 
quantification of a collection of metabolites in biological fluids has emerged as a powerful 
approach for the analysis of birth defects (Moms et al., 1994^ 

has not been developed for the direct analysis of rates of individual ^nzymatfe steps. The 
analytical method described herein for monitoring and quantification of enzymatic dctiviti^s in 
cell homogenates and other biological samples permits simultaneous (multiplex) mbnitorihg 
of multiple reactions, and can be readily automated. 

A feature of the method of this invention as applied to enzyme assays is the use of 
electrospray ionization mass spectrometry (ESI-MS) (Cole et al., 1997) for the simLite 
detection of enzymatic products and chemically identical internal standards, which are 
distinguished by stable isotope (deuterium) labeling. A second feature is the use of affinity 
tagged reagents containing ah enzyme substrate which when combined with affinity 
purification provide for facile capture of enzymatic products from crude biologicai fluids: The 
affinity tagged reagents are designed to contain a target substrate for an enzyme of Interest 
that is covalently attached to an affinity tag via a linker. Action of the enzyme of interest on 
the substrate conjugate causes cleavage or other modification that chariges its molecular 
mass (Scheme 4). The change of mass is detected by ESI-MS. Th6 linker and and affinity 
tag used preferably facilitate ionization by ESI. block action of other enzymes in the 
biological fluid, and allow highly selective capture from the complex matrix for facile 
purification. 
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An example of this approach is the design and synthesis of affinity tagged enzyme 
substrate reageants 1 and 2 (Scheme 5) to simultaneously assay lysosomal (Halactosidase 
and N-acetyl.q-D-glucosaminidase. respectively. Deficiency of the former enzyme results in 
one of the lysosomal storage diseases. GM,-gangllosidosis, a condition that occurs in the 
population with a frequency of about 1 in 50,000 and leads to early death of affected 
children.. Deficiency of N-acetyl-R-D-glucosaminidase results in the rare lysosomal storage 
disorder Sanfilippo syndrome type B. This example has been described in Gerber et al. 

(1999)J. Amer. Chem. Soc. 121: 1102-1103 which is incorporated by reference herein in 
its entirety. 

Conjugates 1 and 2 consist of biotin as an affinity tag. which is coupled to sarcbsine. 
Blotin allows highly specific capture of the substrate conjugate through non-covalent binding 
to streptavidin immobilized on agarose beads (Bayer et al- i 990)., Sarcosine provides an N- 
methylated amide linkage to biotin to block the enzyme biotinidase, which is often present in 
the cellular fluids and could cause cleavage of the conjugate molecule during th6 assay 
(Wilbur et al.. 1997). In addition, it was found that biotinyl-sarcosine conjugates can be 
displaced from streptavidin by addition of biotin. The N-biotinylsarcosihe block is linked to a 
polyether diamine, the length of which can be varied to avoid mass/charge overlaps of 
products and internal standards. The linker also allows facile introduction of multiple ' 
deuterium atoms (i.e.. 8 deuteriums in 5 and 4 in 6. Scheme 5) to permit the synthesis of 
internar standards. The d8-llnker was made by reacting DOCH2CH20CH2CH20D with 
CD2 dCDCN in benzene with catalytic NaOD (Ashikaga. K.; Ito. S.;Yamamoto. M.; Nishijima. 
Y. Bull. Chem. Soc. Jpn. 1988. 61 . 2443-2450) and the resulting dinitrile was reduced to the 
diamine with Ra-Ni. The d4-linker was made in the. same way using ethylene glycol arid CD2 
dCDCN in CH3CN and catalytic NaOH. 

In addition, the linker is hydrophilic to ensure good water solubility of the substrate 
conjugate, and it has basic groups which are efficiently protonated by ESI and thus ensure 
sensitive detection by mass spectrometry. The target carbohydrate substrates are attached 
to the polyether linker by a p-alanine unit (Scheme 5). The enzymatic product conjugates 3 
and 4 are also shown Spheme 5. Conjugates 1 and 2 were prepared as shown in Scheme 5. 
All reagents were purified to homogeneity by reverse-phase HPLC arid characterized by 
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high-field 1 H NMR and ESI-MS. The substrate was linked to the diamine spacer by Mich 
addition of the latter onto the p-acryloylamidophenyl glycoside. (Romanowskaet al.. 1994) 
and the intermediate was coupled with the tetrafluorophenyl ester of N-biotinylsareoslne 
(Wilbur etal., 1997). 

The ESI.MS assay of P-galactosidase and N-acetyi-R^D-gliicosaminidase is based on 
enzymatic cleavage of the glycosidic bond to release monosaccharide and conjugates 3 and 
4 (mass differences are 162 and 203 Da. respectively). In a typical procedure. 0.2 mM 1 and 
0.3 mM 2 were incubated with sonicated cultured fibroblasts from individual patients with > 
galactosidase deficiency and with fibroblasts cultured from unaffected people. After 
incybatlon, labeled intemal standards 5 and 6 were added, and the blotinylated components 
w re captured on streptavidin-agarose beads. Quantitative strepavidin capture efficiency 
from a cell homogenate was observed with model reagents. After purification by multiple 
washings to remove npnspecifically bound components, the blotinylated products were 
released by free biotin, and the eluant was analyzed by ESI-MS. About 85% release of the 
blotinylated products was observed after incubation with excess biotin for 90 min. A blank 
was obtained by quenching the assay with all components present at time zero. 

A typical procedure, cell protein (75 pg) in 15 pL of water was added tb 15 pL buffer 
(0.1 M Na citrate. pH 4.25) containing 2 (0.3 mMyartd 1 (0.2 mM. added 5 h after addition of 
cell protein). After incubation for 5.5 h at 37 'C. the reaction was quenched by addition of 
200 ML of 0.2 M glycine carbonate buffer. pH 10.3. and 5 and 6 (1 nmbl each) were addbd. 
After centrifugation to remove cell debris, the supernatant was loaded onto a bed of 
streptavidin-agarose (7 nmol biotin binding capacity. Pierce) in a small filtration device (micro 
BioSpin. Bio-Rad). After 5 min, filtration was effected by centrifugation. and the gel bed was 
washed with 0.1% Triton X-IOO ( about 1 min incubation, then spin) and then six times with 
purified water (Milli-Q, Millipore). Elution was canned out in 25 pL of 50% methanol containing 
56 nmol of free biotin (1-10 h incubation, then spin). Filtrate was diluted 4-f6ld with 50% 
methanol/water. and 1 pL was analyzed by ESI-MS. 

The ESI-MS spectrum of the blank (Figure XA) is remarkably simple, showing peaks of 
the (M + H)* ions from reagents 1 and 2 (m/z 843 and 840). internal standards 5 and 6 (mfz 
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689 and 641 ). and trace amounts of products 3 and 4 (mfz 681 and 637). Ions due to 
clusters of biotin also appear in the spectrum but did not Interfere with the analysis. The 
presence of nondeuterated products in the blank may be due to norienzymatic substrate 
reagent hydrolysis during sample work up or to collision-induced dissociation of the substrate 
ion in the gas phase. A MS/MS spectrum of the (conjugate 1 + H)* ion at m/z 843 gave a 
prominent fragment of (conjugate 3 + H)* at m/z 681 (spectrum not shown). The ESI-MS 
spectrum of a sample incubated with cell hombgenate from a healthy individual clearly shows 
the p-galactosidase product at m/z 681 and the N-acetyl-a-D-glucosaminidase product at 
m/z 637 (Figure XB). Triplicate enzymatic reactions using cells from a healthy patient yielded 
a p^alactosidase specific activity of 51 ± 3 nmol/h/(mg cell protein) and an N-acetyl-a-D- 
glucosamlnidase specific activity of 1.4 ± 0.3 nmol/h/mg. Time course studies confirmed that 
the initial reaction velocities were being measured. Values obtained with cells from six 
healthy individuals ranged from 36 ± 4 to 68 ± 3 nmol/h/mg for 3-galactbsidase arid 0.9 ± 
0.05 to 2.3 + 0.4 nmol/h/mg for N-acetyl-a-D-glucosaminidase. In contrast, very litUe 
enzymatic product above the blank level (0.9 ± 0.9 and 0.8 ± 0.6 nmol/h/mgj vvas observed 
when cells from two patients with P-galactosidase deficiency were used, whereas N-acetyl-a- 
D-glucosaminiKlase activity is cleariy detected (Figure XC). These spectra were obtained 
with 0.75 pg of cell protein, corresponding to - 1000 fibroblasts. Thus the ESI-MS method 
has very high sensitivity for biomedical applications. 

ESI-MS was carried out on a Finnigan LCQ ion trap instrument. Data were collected in 
full scan mode from m/z 625 to 875 by direct infusion at 1 .5 |jL/min. Specific activities were 
obtained frorri the ratio of product to internal standard ion peak areas (averaged over 30 
scans). 

The approach described for assaying enzymes using substrate reagents and ESI-MS 
can be broadly applied. The multiplex technique can be expanded to assay dozens or more 
enzymes simultaneously in a single reaction, obviating the need for multiple assays to assist 
in confimriing diagnoses of rare disorders. The method can be used to measure several 
enzymes simultaneously when evaluating the rate of chemical flux through a specific 
biochemical pathway or for monitoring biochemical signaling pathways. The affinty tag- 
capture reagent method for isolation of affinty tagged reaction products and substrates from 
complex mixtures is technically simple and can be readily automated, particular when biotin- 
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strepavidin is employed. Because of the high sensitivity of the ESI-MS detection employed, 
which requires only sub-microgram quantities of the substrate reagents per assay, the 
synthesis of seyeral hundred substrate reagents on a low-gram scale becomes practical and 
economical. Since most enzyme active sites are exposed to solvent, it is possible to attach 
an affinity tagged linker to most enzyme substrates while preserving enzymatic activity. 
Scheme 6 provides the structures of several additional enzyme substrates, suitable for use in 
this method, indicating by arrows allowatsle positions for tag attachment sites. Allowable tag 
sites for addrtional enzyme substrates can be detemiined by review of X^ray crystal 
structures ofenzyme-substrateor enzyme-substrate analog structures. Using a standard 
computer graphics program . available X-ray data and by attaching an extended chain biityl 
group (as a model for the affinity tagged linker) to potential tag attachment sites, suitable 
attachment sites that show there are no enzyme.atoms in van def Waals overlap with the 
model tag can be predicted. 

Analogous methods to those described above can be applied to the analysis of 
enzymes associated with other Sanfillipo Syndromes (A. C and D). SFA is associated with 
heparan sulfamidase. SFC is associated with acetyl-CoA-alpha-gjucosaminide N- 
acetyltransferase and SFD is associated with N-acetylglucosamine 6-sulfatase. Exemplary 
affinity tagged enzyme substrate reigents useful in the analysis of these enzymes and the 
diagnosis of these disorders are provided below. The methods can aiso be applied of the 
diagnosis of Niemann-Pick Type A and B disease by assaying for acid sphingomyelinase 
and to the diagnosis of Krabbe disease by assaying for galactocerebroside beta- 
glac3tosidase. These enzymes are currently assayed employing fluorphore-derivatived 
reagents as indicated in Scheme 7. Enzyme substrate reagents for assay of these enzymes ' 
in the methods herein can be readily prepared by replacement of the fluorophore with an A-L 
group herein. This approach to preparation of affinity tagged enzyme substrates is generally 
applicable to any known fluorophore -derivatized enzyme subtrate or substrate analog: 

Table 4 provides exemplary enzymes that are associates with certain birth defects or 
disease states. These enzymes can be assayed by the methods described herein. 
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Assaying Enzymatic Pathways for Carb hydrat -Deficient Glycoprotein Syndromes 
(CDGS) 

The methods and reagents of this invention can be employed to quantify the velocities 
of multiple enzymes pertinent to diagnosis of CDGS diseases. 

CDGS Type la and lb are caused by the deficiency or absence of the enzymes 
phosphomannoisomerase (PMIb) (Type lb) and phosphomannomutase (PMM2) (Type la) 
which are part of a multistep pathway (Scheme 8) for conversion of glucose to mannose-l- 
phosphate (Freeze. 1998). The monosaccharide substrates involved in the pathway are 
fmctose^e-phosphate. mannose-6-phosphate. and mannose-1 -phosphate. These 
monosaccharides can be somewhat difficult to convert to substrate conjugates because it is 
not a priori clear which atom on the sugar should be conjugated with the linker without 
impairing enzyme activity. PMIb and PMM2 cdn, however, be assayed indirectly. 
Mammalian cell microsomes contain do)ichol-P-mannose synthase which catalyzes the 
reaction of dolichol-phosphate with GDP-mannose to form dolichol-P-mannose and GDP 
(Scheme 8, Chapman et al. 1980). This synthase can be assayed using the methods of this 
invention, specifically with a biotln-linker substrate. Microbial PMM and the enzyme which 
makes GDP-mannose from GTP and mannose-1 -P, GDP-rhannose pyrophosphorylase. are 
readily purified from bacteria and yeast (Glaser. 1966, Preiss, 1966), and these enzymes can 
be supplied exogeneously to the enzyme assay. If PMIb activity is deficient, little or no 
mannose-6-P will be made when the reaction sequence is started by addition of fructose-e-P! 
Without mannose-6-P, mannose-1 -P and GDP-mannose will not be made, and thus no 
conjugated-dolichol-P-mannose will be detected by ESl-MS. Exogenous GTP is supplied as 
a requirement for the GDP-mannose pyrophosphorylase step, and ATP is omitted so that 
mannose-6'P cannot be made from marinose. To assay PMM2, the reaction sequence is 
initiated with mannose-6-P. and PMM2 deficiency results in the failure to make conjugated- 
dolichol-P-mannose. 

The carrier dolichol is a -60- to 1 05-carbon isoprenoid. Evidence is accumulating that 
many enzymes that operate on carbohydrates attached to dolichol chains are tolerant to 
significant shortening of the dolichol chain; even 1 0- and 1 5-carbon dolichols are tolerated 
(Rush and Wachter. 1995). It appears that such enzymes act on the water-soluble 
cart3ohydrate portion of the dolichol conjugate and thus have little or no requirement to bind 
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the dollchol anchor. Based on this, an affinity labeled substrate for the direct assay of 
dollchol-P-mannose synthase and the indirect assay of PMIb and PMM2 is prepared by 
attaching an affinity labeled linker to the non-polar end of a short dolichol. such as the 10- 
carbpn dolichol analog citronellol. 

The synthesis of a biotinylated dolichol.o-substrate conjugate containing a sarcosihyl 
linker (B-S-Dol,0-P) Is shown in Scheme 9. Protected citronellol (R = /-BuSiMCa) is 
regloseleclively oxidized at the tenninal alyllic methyl group (McMurry and Kocovsky. 1984), 
and the allylic alcohol is coupled with biotinylsarcosine active ester (R =: GH,). The citroneliol 
1-hydroxy group is subsequently deprotected and phosphorylated with POCIj CRush and 
Wachter. 1 995). In a parallel synthesis. d,-sarcosine. CD3NHCD2COOH. is used to prepaffe 
the isotopically labelled (heavy) reagent for use as an internal standard, d^- Sarcosine is 
readily prepares form commerically available materials (BrCD^pOOD and CDjNHj) using 
standard synthetic techniques. 

The deuterated intemal standard. B-ds-S-Dol^o-P-Mannose. is synthesized 
enzymatlcally by incubating hen oviduct microsomes with GDP-mannose and the synthetic 
B-d5-S-Dol,o-P substrate conjugate (Rush and Waechler, 1995). An added advantage of th 
B-S-conjugate is that it allows for a facile affinity purification of the microsomal mannosylated 
product by specific capture on agarose-streptavidin beads followed by elutlon with free biotln. 

This method employing affinity tagged short dolichol analogues is generally : 
applicable for assaying other enzymes that operated on dolichol anchored carbohydrates 
Such an approach is useful for the subsequent identification of enzyme deficiencies present 
in other types of CDGS that have not been yet identified. 

CDGS Type II results from defective GlcNAc transferase II (GlcNAc-T II) which 
transfers GIcNAc from UDP-GIcNAc to the 2-position of a mannose residue In the 
Intemnediate branched oligosaccharide (the Core Region) in the process of building upth 
disialo-biantennary chain (Scheme 1 0) (Schachter. 1 986. Brockhausen et al. 1 989). GleNAc 
transferase 11 Is one of the six known enzymes that mediate highly reglospecific glycosylatibn 
of the mannose residues in the Core Region. The Core Region Is anchored at the reducing 
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end to chitobiosylasparagine, where the asparagine residue is part of the peptide chain of 
the glycosylated protein. The latter staicture unit in the substrate can be replaced by a 
hydrophobic chain without loss of enzyme activity (Kaur et al, 1 991 ). Thus, the substrate 
conjugate for CDGS Type li is assembled by linking a affinity-labelled linker group to the 
reducing end to chitpbipsylasparagine. However, the latter structure unit in the substrate can 
be replaced by a hydrophobic chain without loss of enzyme activity (Kaur et al, 1991 ). For 
example, commercially available a-D-manno-pyranosylphenyllsothiocyanatecan be boupled 
to a biotin-labelled linker and the 5,6-hydroxyls are selectively protected as illustrated in 
Scheme 11 (Paulsen and Meinjohanns. 1992). Coupling of the equatorial 3-OH. with per-O- 
acetylmannosyl-1-trichloroacetamldate (Paulsen et al, 1993) will provide a disaccharide 
conjugate (Scheme 12 ). If a minor amognt of coupling occurs at the axia| 2-OH group the 
products can be separated by HPLC. After deprotection, the primary 6-OH is coupled vyith a 
second equivalent of per-O-acetylmannosyl-l-trichlorpacetamidate to yield the Core Region 
conjugate. Deprotection of the O-acetyl groups yields the substrate conjugate for GlcNAc 
transferase I which can be converted to the QlcNAc-T II substrate by enzymatic glycbsyl 
transfer using a Triton X-IOO rabbit liver extract, a reaction that has been earned but on a 
preparative scale (Kaur et al. 1981 ). 

The synthesis of the deuterium labeled derivative needed for the internal standard is 
perfonned in parallel by using a labeled PEG-diamine building block (Gerber et al. 1999). 
The biotinylated trisaccharide is converted to the tetrasaccharide (product of GlcNAc-T ll) by 
incubation with UDP-GlcNAc and transferase II ( (Kaur and Hindsgaul. 1991. Tan et al. 1996) 
and utilizing the B-S handle for affinity purification of the enzymatic products. 

CDGS Type V 

The lipid-linked bligosaccharide (LLO) that is transferred to the Asn residue of the 
glycosylated protein is composed of 2 GlcNAc. 9 mannoses, and 3 glucoses. It has recently 
been shown that microsomes from CDGS type V patients are greatly deficient in the enzyme 
that transfers one or more glucose residues during LLO biosynthesis (Komer et al, 1998). 
Since the transferase that attaches the carbohydrate unit of LLO to the Asn residue 
discriminates against the glucose-deficient LLO. CDGS Type V patients have fewer numbers 
of carbohydrate units atached to glycoproteins, such as transfenin (Komer et al. i 998). 
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However, the few carbohydrate units that are present are full-length, demohstrating that 
residual glucosyl transfer occurs in type VCDGS patients (Korner et al. 1998). Thus, 
quantification of the rate of Asn glycosylation by ESI-MS would constitute a viable assay of 
CDGS Type V syndrome. 

Synthetic peptides with 3-7 amino acid residues containing the Asn-Xaa-SerTThr 
sequence have been shown to be good substrates for glycosylation (Ronin et al., 1981). 
Th^ strategy for the ESI-MS assay of the bllgosaccharide transferase relies on a B-S 
conjugate of an appropriate peptide containing the Asn-Xaa-Sef/Thr sequence (SchemeiS). 
A heptapeptide. NH^-Tyr-Gln-Ser-Asn-Ser-Thr-Met-NH^ has shown high activity in a 
previous study (Ronln et al. 1981 ). The peptide Is readily available by standard peptide 
synthesis using an in-house automatic synthesizer. The heptapeptide and Its 
glycoconjugates cgn be ionized by ESI to provide stable singly-charged Ions. Coupling of 

BS-tetrafluorophenyl ester with NH^-Tyr-Gln-Ser-Asn^er-Thr-MetlNH^ will dir^^^^^^ yield the 
substrate for the transferase. Several products are expected from the enzymatic 
glycosylation and subsequent modifications of the oligosaccharide antenna. The products 
can be prepared enzymatically by incubating thyroid rough microsomes with BS-Tyr-GIn- 
Ser-Asn-Ser-Thr-Met-NHj and Dol-P-Glu (Ronin et al. 1981a). followed by affinity 
purification of the biotlnylated products. Product distribution due to different degrees of 
glycosylation can be monitored by ESI-MS, and the major components can be purified by 
HPLG. An analogous procedure using a B-NCCDg^CDjCO- conjugate is used to prepare 
deuterated internal standards. 



The molecular masses of the ionized substrate conjugates for the set of enzymes 
assayed for CDGS la. lb. II. and V syndromes, as well as products and internal Standards 
are compiled in Table 5. which shows that no isobaric overiaps among the (M + H)^ species 
occur. The close spacing between the (M + Na)* ion from the Type la.b product and the (M + 
H)* ion of the demannosylated B-(N.C,D,)-2.2-D,-Gly.Dbl,o.P internal standard can be rfeadily 
avoided by adjusting the ESI-MS conditions by addition of Na* ions to generate the gas 
phase Ions as Na-adducts. 

All three of the targeted enzymes can be analyzed simultaneously in a single 
biological sample, such as a cell lysate. The PMM2 and PMIb cannot be assayed 
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Simultaneously because they require the addition of different exogeneous substrates. 
Nevertheless, two assays using identical ESi-MS techniques can be used for diagnosing the 
various CDGS types instead of relying on a battery of different methods. 

Clinical Enzymology Assays 

A fibroblast cell pellet is thawed on ice. Sufficient 0.9% NaGI is added tb give a 
final protein concentration in the lysate of ~ 5mg/ mL (typically 100 mcL). and 
the cell pellet is sonicated in Ice water 5 times for 2 seconds each at moderate 
power. Total protein is determined spectrophotmetricaliy using the BCA reagent 
(BCA Protein Assay kit, Pierce). 

The total enzyme reaction volume is 20 Q 30 mcL: The substrata stock solutions " 
are maintained at concentrations of 3mM (SFB) and 2mM (GM1 ). These concentrations 
were measured by 1 H-NMR signal integratin versus an intemal standard (formamide 
proton of DMF). Fin^l concentration of substrates is 0.3 and 0.2 mM. respectively. A - 
volume of reaction buffer (e.g. 200mM sodium citrate. pH 4.5) equal to the difference of 
the substrate addition (2-3 mcL) plus sufficient cell samplfe volume to equal 50 Q 75mcg 
total protein from 20-30 mcL is added to a 0.5 mL Eppendorf tube, followed by substrate^ 
The sample is cooled on ice. and patient cell sample is added. The reaction is initiated by 
incubation at 37 deg C. 

For SFB: The reaction is allowed to proceed for 4.5 Q 6 hours, after which GM1 
substrate can be added or the reaction can be quenched with 100 mcL of 20OrnM glycine- 
carbonate buffer, pH 10.5. 

For GM1 : The reaction is allowed to proceed for 0.5 hours. Quenching is as for 

SFB. 

After quenching, the samples are placed on ice. Interna! standards are added 
(1 nmol each. I.e. 50 mcL of a 0.02 mM solution). The samples are microfuged at 
-1 5.000 rpm for 2 min at room temperature to pellet cell debris. Streptavidin-Agarose 
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beads (Immunopure immobilized streptavidin. Pierce) are placed in a micro bio^spin 
crVromatography column (Bio-Rad). Sufficient beads are add6d to give a total biotiri 
binding capacity of 5 nmol (typical binding capacity tpOpmol per mcLofbeadsas 
determined by Pierce). The sample supernatant is transferred to the bio-spin tube and 
allowed to bind for 10 minutes at room temperature. The sample is spun at -3.000 rpm to 
remove excess supematant. then washed once with 0.01% Triton X-IOO and at least five 
times with purified water, spinning the tube in-between to remove soltitlon. For each 
wash, sufficient wash solution is added to fill the bio-splh tiibe. 

The purified beads are then treated with 30 mcL purified water, folloWed by 10 
mcL of a 4mM biotin solution. The tubes are capped at the bottoril to prevent leakage and 
allowed to incubate at 2-8*C for 2-12 hours. The samples are spun at -3,000 rpm to elute 
th sample into a clean Eppendorf tube. 

The sample is then diluted with 60 moL of 50% methanol/water and infused Into 
th ion-trap mass spectrometer. The ESI-MS spectrum is tuned to reduce non-specific 
cl avage of the samples by first analyzing a blank sample (cell lysate added after reaction 
quench). The infused sample is analyzed by ion Ghromato0ram integration of a 1 amu- 
wide window about the (M + H+)+ ions of product and internal s^^^ 

Results are reported In nmol product formed/hour of incubation/milligrams total 
protein in reaction mixtures 

Clinical Analysis of Patient Samples for GM1 and SFB 

Patient skin fibroblasts were obtained as frozen pellets, and stored at 020 deg C until 
use. Two GM1 affected samples and six nonnal controls were analyzed. 

50 mcL of 0.9% NaCI was added to each patient cell pellet. The pellets were 
lysed by sonication in ice water 5x for 2 seconds each at moderate sonication power; 
chilling the microtip in ice vyater in between sonicatlons. 
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Samples were quantitated by BCA (Pierce) assay as follows: 

Reagent A and B were mixed in 50:1 ratio as described. A protein standard curve 
was prepared using bovine serum albumin as a standard at concentrations of 2, 1, 0.5. 0.2, 
and 0.05 mg/ mL. A portion of the patient sonicates were diluted 1 :1 5 in water, and 5 
mcL of each diluted paUeht sample and standard curve point was added to separate glass 
culture tubes containing 200 mcL water. In duplicate. This samples were theri diluted with 
1 mL of the mixed BCA reagent, vortexed to mix, and incubated at 37 deg G for 60 
minutes. The samples were allowed to cool to room temperature, and analyzed against a 
blank containing only 200 mcL water. The samples were analyzed by monitoring 
absorbance at 562nm in polystyrene cuvettes. Average patient absorbance values were 
blank con-ected and compared to standards via linear regression. ^ 

The patient protein concentrations were determined to be: 

1. (Affected) 12.2mg/ mL. 2. (Normal) lO.Smg/ mL.,3. (Normal) 1 1.9mg/ mL, 4. 
(Nomrial) 12.1mg/ mL. 5. (Normal) lO.Smg/ mL, 6. (Normal) 7.79mg/ mL. 7. (Normal) 
15.7mg/ mL. 8. (Affected) 1 1 .4mg/ mL 

Incubations were performed in a total of 30 mcL of total volume. The amount of 
reaction buffer (200mM sodium citrate. pH 4.25) added to blank Eppendorf tubes was the 
difference of the substrate volume (3 mcL of each substrate stock solution. 2mM for 
GM1 and 3mM for SFB. for a total of 6 mcL) plus the volume of cell lysate required to 
equal 75mcg total protein, from 30 mcL. For example, patient 1 . incubation mixture 
initially contained 3 mcL of SFB substrate solution. 6. 1 4 mcL patient cell lysate. and 
1 7.86 mcL reaction buffer. The GM1 substrate was added later in the incubation (see 
beloW). 

• 

Each patient sample was analyzed in triplicate. The reaction mixtures were kept 
on ice during preparation, and the reaction was initiated by transfer to a 37 deg C water 
bath. 5.5 hours into the incubation. 3 mcL GM1 substrate was added to each reaction, and 
after an additional 0.5 hours the reactions were placed on ice and quenched with 200 mcL 
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of a 200mM glycine-carbonate buffer, pH 10.25. 

The purification and analysis procedures are as des;cribed in Clinical 
Enzymology Assay (Typical). 

The resultant enzyme activities, as ah average standard deviation nrhol 
product/hour incubation/mg total protein: 



B-Gal - SFB 



Normals 


1 RATE 


+/-SD 


Rate 


+/-SD 1 


Patient 2 


68.0 


2.6 


0:90 


0.05 


Patients 


■35.5 ■■■ 


3.9 


1.54 


0.38 ' 


Patient 4 


51.1 


2.7 


1,36 


0.26 


Patient 5 


38.8 


8.3 


1.01 


0.12 


Patients 


51.4 


9-9 


2.25 


0.36 


Patient 7 


40.9 


3.7 


1.12 


0.20 


Affecteds 










GM, (#1) 


0.9 


0.9 


0.80 


0.21 


GMi (#8) 


0.8 


0.6 


0.70 


0.20 
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The following synthetic method refer to Schemes 14-23 

Synthesis for GM1-gangliosidosis (beta-D-galactosidase deficiency) 

1 . 2,3,5.6-Tetrafluorophenyl trifluorbacetate (1) 25g (0.15mol) 2,3,5,6- 
tetrafluorophenol, 35 mL (0.2mol) trifluoroac6tic anhydride and 0.5 mL 
boron trifluoride etherate were refluxed for 18 hours under argon 
atmosphere. Trifluoroacetic anhydride and trifluoroacetic acid wei"e 
removed by distillation at room temperature. The trifluoroacetic 
anhydride fraction was retumed to the mixture, and the reaction was 
refluxed for-24 hours. This was repeated twlc6. After final distlllatioh at 
room temperature, the desired product 1 was distilled at reduced 
pressure (62?C/45mmHg) to produce a coloriess liquid (30g, 82%). i H- 
NMR. (Ref. Gamper. H. B., Nucl. Acids Res., v21 pp145-1 50) 

2. Biotin-2.3,5,6-tetrafluorophenyl ester (2) A 2.5g (1 0.3mmol) quantity of 
d-biotin in 20 mL anhydrous DMF under argon atmosphere was 
wanned to 60?C with stirring to effect dissolution, 1.7 mL (12.5mmol) 
triethylamine was added, followed by 3.4g (1 2.5nnmol) 1 . The mixture 
was stirred for 2 hours, after which the solvent was removed by rotary 
evaporation. The resultant semi-solid was triturated With 1 5 nlL ether 
twice to produce a white solid (2.6g, 65%). 1 H-NMR; (Ref. Wilbur, D. 

S., Bioconj. Chem., v8 PP572-584) 

3. N-methylglycylbiotinamide-methyl ester (3) A 2.5g (6.4mmol) quantity 
of blotin tetrafluorophenyl ester in 30 nriL anhydrous DMF under argon 
atmosphere was added to a mixture of 1.1g (7;7mmol) N-methylglycine 
methyl ester hydrochloride dissolved in 10 mL anhydrous DMF and 

1 .25 mL (9.0 mmdl) triethylamine. The reaction nriixture was stinred at 
room temperature for 2 hours, then the solvent was removed by rotary 
evaporation. The residue was extracted with chorofomri (2x1 00 rhL), 
washed with water (2x20 mL), and drieid with anhydrous sodium 
sulfate. The solvent was removed under vacuum to yield 2.1g (98 %) 
of methyl ester of N-methylglycine biotinamide as an oil. 1 H-NMR. (Ref. 
Wilbur, D. S., Bioconj. Chem., v8 pp572-584) 
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4. N-methylglycylbiotinamide acid (4) N-Methylglycylbiotinamlde methyl 
ester was hydrolyzed in a mixture of 31 mL MeOH and 10 mL of IN 
NaOH at room temperature with stirring for 1 hour. The mixture was 
diluted with 50 mL 50% MeOH/water and neutralized with cation 
exchange resin, hydrogen fonn (AG MP-50. BioRad). The solution was 
filtered, the resin washed (3x50 mL) with 50% MeOH/water, and the 
solvents removed by rotary evaporation to yield 1 .6g (90%) of N- 
methylglycylbiotinamide acid as an off-white solid. 1H-NMR. (Ref/ 
Wilbur, D. S., Bioconj. Qhem., v8 pp572-584) 

5. p-Acrylamidophenyl-?-p-gaJactopyranoside (5)40mg (0.15mmbi) p^ 
aminophenyl ?-p-galactopyranoside was added to 25 mL methanol 
and 200 mcL triethylamine with stirring, the solution vyas chilled in ah 
ice bath. 53.3mg (O.emmol) acryloyi chloride was dissolved In 5 mL dry i 
methylene chloride and added dropwise to the stinred solution over 5 
minutes. The reaction was allowed to return to room temperature, 
followed by 2 hours of stining. The solution was then treated with 
successive anion and cation exchange resins (AG MP-1 and AG MP- 
50, respectively, BioRad) until neutral pH was obtained with moist pH 
paper. Solvent was removed by rotary evaporation to yield a solid 
(43mg. 90%). 1H-NMR. (Ref. Romanowska, A., Methods EnzymoL, 
v242 PP90-101) 

6. Michael addition product of 4,7,1 0-trioxa-1 , 1 3-tridecanediamine and 5 
(6) 20mg (0.07mmol) 5 was added to a stirred solution of 80mg 
(0.35mmol) 4,7, 10-trioxa-1,1 3-tridecanediamine in 5 mL 0.2M sodium 
carbonate, pH 10.5 at 37?C. The reaction was allowed to proceed for 3 
days, after which the solution was neutralized with dilute trifluoroacetic 
acid and purified by reverse-phase HPLC (Vydac G-l 8 prep-scale 
column, 6 mL/min. Mobile phase: H20 (0.08%TFA)/ AON (0.08%TFA)) 
to give 7.3 mg of product. (Ref. Romanowska, A., Methods Enzymol., 
v242 PP90-101) 

7. GM1 substrate conjugate of 4 and 6 (7) A 2.5mg (7.4mcmol) quantity 
of 4 was dissolved in 1 .5 mL anhydrous DMF with stining, under argon 



44 



wo 00/11208 



PCt/US99/19415 



atmosphere. 5 mcL triethylamine was added, followed by 2.3mg 
(8.8mcmpl) 1 . The formation of active ester was monitored by silica 
TLC (5:1 CHCI3/CH30H. Rf 0.5, UV) by briefly drying the spotted TLC 
plate with a stream of air. After 25 minutes, the mixture was added to 
3.2mg (5.9mcmol) 6 in 1 mL anhydrous DMF. After 2 hours, the 
solvent was removed by vacuum centrifugation and the final product 
was purified by reverse-phase HPLC (Vydac C-18 prep-scale column. 
6 mL/mln. Moblle phase: H20 (0.08%TFA)/ ACN (0.08%TFA)). Yield 
4.6 mg. (Analogous chemistry, ref. Wilbur, D. S., Bidconj. Chem., v8 
PP572-584) 

8. 1,2,10,1 1-octadeutero-3,6,9-trioxa-1 ,1 1-undecartedinltrile (8) 1g 
(9.4mmol) of diethylene glycol was dissolved in 2 mL P20 in a 1 0 mL 
round bottom flask under argon atmosphere. The D20 was removed by 
rotary evaporation and the process was repeated 4 times. The d-2 
diethylene glycol was additioned with 25 mL dry benzene, followed by • 
1 .6 g (28,2mmpl) d-S acrylonitrile with stinihg under argon atmosphere. 
After 1 2h, the solvent was removed under reduced pressure and the 
resultant semisolid was extracted with chlorofonri (2x5 mL). The 
solvent was removed by rotary evaporation to yield 1.85g (89%) 
product. (Ref. Ashikaga, K., Bull. Chem. Sdc. Jpn., v61 pp2443-2450) 

9. 2,3,1 1,1 2-octadeutero-4,7,10-trioxa-1 ■13-tridecanediamihe (9) Raney 
nickel (Aldrich) was washed five times with anhydrous methanol by 
inversion and decantation. 50mg of the washed catalyst was placed in 
20 mL anhydrous methanol, followed by 1g (4.6mmol) 8 in a 50 rhL 
screw-cap vial fitted with a Teflon-lined rubber septum. The vial 
headspace was flushed for a few min with H2 gas via an 18-gaUge 
needle piercing the septum. The cap was screwed on tightly and the 
entire assembly was charged to 40psl H2 and placed in a hot water 
bath (8Q;;C) for 4 hours, after which the solid catalyst was removed by 
filtration and the methanol evaporated. The final product was purified 

by reverse-phase HPLC (Vydac C-18 prep^scale Column, 6 mL/min. 
Mobile phase: H20 (0.08 / TFA)/ ACN (0.08%TFA)). Yield 1 80 mg: 
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(Ref. Ashikaga, K;/ Bull. Chem. Soc. Jpn., v61 PP2443-2450) 

10. Deuterated analog of 6 (10) 25mg (0.09mmol) 5 was added to a stirred 
solution of 90mg (0.4mmol) 9 In 5 mL 0.2M sodium carbonate. pH 1 6.5 

at 37?C. The reaction was allowed to proceed for 3 days, after which 

the solution was neutralized with dilute trifliibroacetic acid and purified 
by reverse-phase HPLC (Vydac C-18 prep-scale column, 6 mL/min. 
Mobile phase: H20 (0.08%TFA)/ ACN (0.08%TFA)). Yield 6 mg. 

1 1. Deuterated analog of 7 (1 1) A 3mg (8.4mcmol) quantity of 4 was 
dissolved in 0.7 mL anhydrous DMF with stining. under drgbn 
atmosphere. 5 mcL triethyla mine was added, followed by 2.4mg 

(8.9mcmol) 1 . The fonriation of active ester was monitored by silica 
TLC (5:1 CHCI3/CH30H, Rf 0.5. UV) by briefly drying the spotted TLC 
plate with a stream of air. After 25 minutes, the mixture was added to 

6mg (11 mcmol) 10 in 1 mL anhydrous DMF. After 2 hours, the solvent 
was removed by vacuum centrifugation and the final product was 
purified by reverse-phase HPLC (Vydac C-18 prep-scale column. 6 

mL/min, Mobile phase: H2O (0.08%TFA)/ ACN (0.08%TFA)). Yield 1.8 

Tig- 

12. GM1 internal standard conjugate (12) 1.8mg 1 1 was added to 2 mL 
lOOmM Tris/IOmM MgCI2, pH 7.3 buffer with stining. 15units 

recombinant ?-D-galactosidase (Sigma) was added, and after 12 hours 
the mixure was purified by reverse-phase HPLC (Vydac C-18 prep- 
scale column, 6 mL/min. Mobile phase: H20 (0.08%TFA)/ACN 
(0.08%TFA)). Yield 1.5 mg. 

Pdlyether diamine linker synthesis (Second Generation) 

Synthesis is based on chemistry previously described (Ketaky. R. et. al. J CHEM 
SOC PERK T 2 (2) 321-327 FEB 1990). with minor modifications and an additional two 
steps. As an example, deviations from the established procedure as well as exact details 
for the additional steps are outlined below for the starting material diethylene glycol. 
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1.1 1-Dicyano-3,6,9-trioxaundecane (13) To a stirred solution of 2% (w/v) sodium 
hydroxide (5 mL) and diethylene glycol (5.3g, Sbmrtiol) was added acrylonitrile (7.95g, 
ISOrnmol). The mixture was stirred at room temperature overnight and additloned with 
50 mL dichloromethane. The organic layer was washed 2x with brine and dried (MgS04). 
The solvent was removed by rotary evaporation. The oily residue was treated with 200 
proof ethanol. and the solvent was removed by rotary evaporation. This was repeated 2x 
to remove excess unreacted acrylonitrile. The product was used without further 
purification 

Diethyl 4.7. 1 0-trioxatridecane-1 . 1 3-dioate (14) 2g (9.4mm6l) 1 3. was dissolved 
in 5 mL ethanol. 1g cone, sulfuric acid was added slowly, over 5 minutes. The reaction 

was heated to reflux overnight. The reaction w^s extracted with 40 mL methylene 
chloride, washed once with 10 mL water and 3x with 10 mL dilute brine solution. The 
organic layer was dried (MgS04) and solvent was removed to yield an oil. The final 
product was purified by silica chromatography (methylene chloride/ethyl acetate). 

1.1 3-dihydroxy-4,7. 1 0-trioxatridecane (1 5) Prepared exactly as described, using 
tetrahydrofiiran as solvent. (i.7g. 5.5mmol 14. 50 mL distilled [CaH2] THF. 0.66g. 
16.5mmol lithium aluminum hydride). Once addition Was complete, excess LAH was 
quenched with ethanol. and the salts precipitated by dropwise addition of saturated 
sodium sulfate solution until a white precipitate formed. The solvent was removed, the 
precipitate washed 6x 30 mL with THF and the combined organic extracts were 
evaporated to yield an oil. Final product is purified by silica chromatography (first with 
niethylene chloride then with ethyl acetate and finally with acetone). 

1 ,13-dichloro-4.7.1 0-trioxatridecane (analog using P2) (16) l.lg. (4.9mmol) 15 
was added to 1.15g (14.6mmol) distilled pyridine in 30 mL dry benzene with stirring, 
followed by 1.8g (14.6mmol) thionyl chloride. The mixture was heated to reflux for 6 
hours. After cooling In an ice bath. 5 mL 3M HGI was added with vigorous stining. the 
organic layer was separated, washed 3x with a dilute brine solution, and dried (NaS04) to 
yield a yellowish oil. After washing and removal of solvent, the dichloride was used 
without further purification. 
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Additional Steps 

1 J 3-dicyano-4,7,1 0-trioxatridecane) (1 7) T6 a Stirred soluto^ 
(l5.5mmol) sodium cyanide in 4 mL dimethyl sulfoxide at 8d*C was added 1g 

(3.9mmol) of 16. After 2 hours, the reaction was additioned with 10 mL df satura^^ 
sodium chloride solution. 5 mL of water, and 50 mL ethyl acetate. The organic layer was 
washed 3x with a brine solution as before, after which the organic layer was dned 
{Na2S04) and the solvents removed. The final product was purified by silica 
chromatography (methlyene chloride/ethyl acetate). ESi-MS: predicted. 240.1 : observed, 
241.1 (M + H+)+ 

1,1 5-diamino-5,8, 1 1 -trioxapentadecane (1 8) A stirred solution of 50 mL dry 
THF containing 0,42g (I0.4mmol) fresh LAH was heated to gentib reflux Undfer argon for 
15 niihutes. 0.5g (2mmol) 17 In 15 mLdry THF wa^ added drdpwise over 20 minutes, 
maintairiing a gentle reflux. The unreacted LAH was quenched v(^ith ethandl, and the 
mixture was treated with dropwise addition of saturated sodium sulfate under efficient 
stirring until a white precipitate formed. The mixture was filtered, and the precipitate was 
washed 6x 30 mL with THF. The organic extracts were combined and the solvent was 
removed by rotary evaporation to yield an oil. ESI-MS: predicted, 248.1 : observed, 249.1 
(M + H+)+ 

Oeuteration 

Deuterium has been Incorporated into the diamine linker by reduction of 14 and 
17 using lithium aluminum deuteride (98% D) to achieve a d-8 deuterated diamine. No 
other aspects of the synthesis were changed for this procedure. These diols are used in the 
cpnstnjction of the SFD conjugates as described later: 

Clinical Substrate Synthesis for Sanfllippd Syndrome, type B (N-?-D- 
glucosamlnldase deficiency) 

13. p-Amlnopheriyl-?-D-N-acetylglucosamine (19) 20mg (0.67mmol) p- 
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Nitrophenyl-?-D-N-acetylglucosamine (Sigma) was added to 5mg 
washed palladium catalyst on activated carbon in 3 mL methanol with 
stirring in a 5 mL septa-lined vial. The septum wals pierced by a 1 6- 
gauge needle and the vial headspace was flused with H2 gas. H2 gas 
was allowed to slowly bubble through the solution for 2 hours, after 
which the catalyst was removed by filtration over diatomaceous earth 

(Celite). The solvent was removed by rotary evaporation to yield a 
semi-solid 18mg (90%). x 

14. p-AcryIamidophenyl-?-D-N-acetylglucosamine (20) lOmg (0.03mmol) 
1 9 was added to 1 5 mL methanol and 1 00 mcL triethylamine with 
stirring. The solution was chilled in an ice bath. I5mg (0.17mmol) 
acryloyi chloride was dissolved in 2 mL dry methylene chloride and 
added dropwise to the stin-ed solution over 5 minutes. The reaction 

was allowed to return to room temjDerature, followed by 2 hours of 
stirring. The solution was then treated with successive anion and 
cation exchange resins (AG MP-1 and AG MP-50. respectively, 
BioRad) until neutral pH was obtained with moist pH paper. Solvent 
was removed by rotary evaporation tb yield a solid (11 mg, 95%)! 1 H- 
NMR. Yield 11 mg. 

15. 3.6-dioxa-1,9-nonanedinitrile (21) 2g (0.032mpl) ethylene glycol was 
added to 0.5 g dry potassium hydroxide in 30 mL dry benzene, 
followed by 5g (0.096mmol) acrylonitrile with stirring overnight at room 
temperature. The reaction was filtered and the solvent was removed by 
rotary evaporation to yield an oil. Final product was purified by silica 
chromatography (chloroform/methanol) to yield a coloriess oil 3.2g 
(60%). 

16. 4,7-dioxa-1 ,1 0-decanediamine (22) Raney nickel (Aldrich) was washed 
five times with anhydrous methanol by inversion and decantatlon. 
50mg of the washed catalyst was placed in 20 mL anhydrous 
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methanol, followed by 1g (6mmo() 21 in a 50 mL screw-cap vial fitted 
with a Teflon-lined rubber septum. The vial headspace was evacuated 
with H2 gas via an 16-gauge needle piercing the septum, The cap was 
screwed on tightly and the entire assembly was charged to 40psi H2 
and placed in a hot water bath (SOi C) for 4 hours, after which the solid 
catalyst was removed by filtration and the methanol evaporated. The 
final product was purified by reverse-phase HPLG (Vydac C-18 prep- 
scale column. 6 mL/min. Mobile phase: H20 (0.08%TFA)/ ACN 
(0.08%TFA)). 

17. Michael addition product of 20 and 22 (23) 5mg (0,Q15rnmol) 20 was 
added to a stin-ed solution of 13mg (O.OBmmol) 22 In 5 mL 0.2M 
sodium carbonate, pH 10.5 at 37?C. The reaction vvas allowed to 
proceed for 3 days, after vyhich the solution was neutralized With dilute 
trifluoroacetic acid and purified by reverse-phase HPLC (Vydac C-18 
prep-scale column. 6 mUmin. Mobile phase: H20 (0.08%TFA)/ AGN 
(0.08%TFA)). Yield 6 mg. 

18. SFB substrate conjugate of 4 and 23 (24) A 4mg (0.01 3mm6l) quantity 
of 4 was dissolved in 1.5 mL anhydrous DMF with stimng. under argbn 
atmosphere. 1 0 mcL dry triethylamine was added, followed by 4mg 
(0,015mmol) 1. The formation of active ester was monitored by silica 
TLC (5:1 CHCI3/CH30H, Rf 0.5, UV) by briefly drying the spotted TLC 
plate with a stream of air. After 25 minutes, the mixture was added to 
6mg (0.012mmol) 23 In 1 mL anhydrous DMF. After 2 hobrs, the 
solvent was removed by vacuum centrifugation and the final product 

was purified by reverse-phase HPLC (Vydac C-18 prep-scale column, 
6 mL/min. Mobile phase: H20 (0.08%TFA)/ ACN (0.08%TFA)). Yield 
4.2 mg. 

1 9. 1 ,9-tetradeutero-3.6-dioxa-1 ,9-nonanedinitrile (25) 0.5g (8mmol) 
ethylene glycol was added to 0.1 g dry potassium hydroxide in 20 mL 
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acetonitrile, followed by 1 .4g (24mmol) d-3 acrylonltrile with stirring 
overnight at room temperature. The reaction was filtered and the 
solvent was renrioved by rotary evaporation to yield an oil. Final 
product was purified by silica chromatography (chloroform/methanol) to 
yield a coloriess oil 0.9g (65%). 

20. 1,9-tetradeutero-3,6-dioxa-1,9-nonanediamine (26) Raney nickel 
(Aldrich) was washed five times with anhydrous methanol by inversion 
ancJ decantation. 20mg of the washed catalyst was placed In 30 mL 
anhydrous methanol, followed by 0.5g (3mrTiol) 25 in a 50 mL screw- 
cap vial fitted with a Teflon-lined rubber septum. The vial headspace 
was evacuated with H2 gas via an 18-gauge needle piercing the 
septum, The cap was screwed on tightly and the entire assembly was 
charged to 40psi H2 and placed in a hot water bath (SQj C) for 4 hours, 
after which the solid catalyst was removed by filtration and the 
riiethanol evaporated. The final product wa3 purified by reversfe-phase 
HPLC (Vydac C-18 prep-scale column, 6 mL/min. Mobile phase: H20 
(0.08%TFA)/ AGN (0.08%TFA)). 

21. Deuterated analog of 23 (27) 20nig (p.07mmol) p^acrylamidophenyl ?- 
D-galactoside was added to a stirred solution of 90mg (0.4mmol) 26 in 

5 mL 0.2M sodium carbonate, pH I 0.5 at 37?C. The reaction was 
allowed to proceed for 3 days, after which the solution was neutralized 
vyith dilute trifluorpacetic acid and purified by reverse-phase HPLG 

(Vydac C-18 prep-scale column, 6 niL/rnin. Mobile phase: H20 
(0.08%TFA)/ ACN (0.08%TFA)). Yield 2 mg. 

22. Deuterated analog of 24 (28) A 2mg (6.3mcmol) quantity of 4 was 
dissolved in 1.5 mL anhydrous DMF with stining, under argon 
atmosphere. 5 mcL triethylaminei was added, followed by 2. 1 mg 
(7.6mcmol) 1 . The fomiation of active ester was monitored by silica 
TLC (5:1 CHCI3/CH30H, Rf 0.5. UV) by briefly drying the spotted TLC 
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plate with a stream of air. After 35 minutes, the mixture was added to 
4mg (Tmcmol) 27 in 1 mL anhydrous DMF- After 2 hours, the solvent 
was removed by vacuum centrifugation and the final product was 
purified by reverse-phase HPLC (Vydac C-18 prep-scale column, 6 
mL/min. Mobile phase: H20 (0.08%TFA)/ ACN (0.08%TFA)). Yield 1 .2 
mg. 

23. SFB Internal standard conjugate (29) 1 .2mg 28 was added to 2 mL 
lOOmM Tris/10mM.MgCI2, pH 7.3 buffer with stirring. 15 units 
recombinant ?-D-galactosidase (Sigma) was added, and after 12 hours 
the rnixure was purified by reverse-phase HPLG (Vydac C-18 prep- 
scale column, 6 mLVmln. Mobile phase: H20 (0.08%TFA)/ACN 
(0.08%TFA)). Yield 0.7 mg. 

Clinical Substrate Synthesis for Sanfillppo Syndrome, type D 
(a sulfatasiB deficiency) ; 

24. p-Acrylamidophenyl-?-D-N-acetylglucosamine-6-sulfate (30) lOOmg 
(0.28mmol) 20 was added to 1 0 mL dry DMF under argon atmosphere 

with stirring at room temperature. 89iTig (0.56mm6l) sulftirtrioxide- 
pyridine complex was dissolved in 2 mL dry DMF and was added to 
the reaction in 0,7x, l.lx, 1.3x and 1 .9x equivalents (+700 mcL, +400 
mcL, +200 mcL, and +600 mcL). The reaction progress was monitored 
by 1 H-NMR shift of the anomeric (CI ) proton chfertiical shift from 5:29 

to 5.24 ppm by removal of 15 mcL of solution 1 hour after addition of 
each amount of sulfating reagent. The remdVed mixture was dried by 
vacuum centrifugation and redissolved In d-6 DMSO and analyzed. 
Upon the appearance of more than two forms (starting material and C- 
6 sulfate) of the CI anomeric proton, the reaction was removed to - 
20?C and stored. The product was purified by vacuurn centrifiigatibn to 
remove solvent, followed by reverse-phase HPLC (Vydac C-18 prep- 
scale column, 6 mL/min. Mobile phase: H20 (0.08°/ TFA)/ ACN 
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(0.08%TFA)). Yield 72%. 

25. Michael addition product of 18 and 30 (31) 25mg (0.058mmol)30 was 
added to a stin-ed solution of 83mg (0.35mmol) 16 in 5 mL 0.2M 
sodium carbonate. pH 10.5 at 37?C. The reaction was allowed to 
proceed for 3 days, after which the solution was neutralized with dilute 
trifluoroacetic acid and punfied by reverse-phase HPLC (Vydac C-1 8 
prep-scale column, 6 mL/min. Mobile phase: H20 (0.68%TFA)/ ACN 
(0.08%TFA)). Yield 10 mg. 

26. SFD substrate conjugate of 4 and 31 (32) A 5.7mg (0.01 8mmol) 
quantity of 4 was dissolved in 1.0 mL anhydrous DMF with stirring, 
under argon atmosphere. 20 mcL dry triethylamine was added, 
followed by 5,5mg (0.020mmol) 1 . The fonnatiori of active ester was 
monitored by silica TLC (5:1 CHCI3/CH30H, Rf 0.5. UV) by briefly 
drying the spotted TLC plate with a stream of air. After 25 minutes, the 
mixture was added to lOmg (0.015mmol) 31 in 1 mL anhydrous DMF. 
After 2 hours, the solvent was removed by vacuum centrifugation and 
the final product was purified by reverse-phase HPLC (Vydac C-i 8 
prep-scale column, 6 mL/min. Mobile phase: H20 (0:p8%TFA)/ ACN 
(0.08%tFA)). Yield 5.4 mg. 

27, 1,2.14,15-octadeuter6-1,15-diamino-5.8.1 1-trioxapentadecarie (33) As 
referenced in 6Polyether Diamine Linker Synthesis. Second Generation 

28. Deuterated analog of 31 (34) 25mg (0.07mmol) 20 was added to a 

stin-ed solution of lOOmg (0.4mmol) 1 1 in 5 mL 0.2M sodium 

carbonate, pH 10.5 at 37?C. Thei reaction was allowed to proceed for 3 

days, after which the solution was neutralized with dilute trifluoroacetic 

acid and purified by reverse-phase HPLC (Vydac C-1 8 prep-scale 

column, 6 mL/min. Mobile phase: H20 (p.08%TFA)/ ACN 
(0.08%TFA)). Yield7 mg. 

29. SFD internal standard conjugate (35) A 4mg (12.6mcmbl) quantity of 4 
was dissolved in 1 mL anhydrous DMF with stining, under argon 
atmosphere. 20 mcL triethylamine was added, followed by 4mg 
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(14mcmol) 1 . The formation of active ester was monitored by silica 
TLC (5:1 CHCI3/CH30H. Rf 0.5. UV) by briefly drying the spotted TLC 
plate with a stream of air. After 20 minutes, the mixture was added to 
7mg (1 1 mcmol) 34 in 1 mL anhydrous DMF. After 4 hours, the solvent 
was removed by vacuum centrlfugatlon and the final product was 
purified by reverse-phase HPLC (Vydac C-18 prep-scale column. 6 : 

mUmin. Mobile phase: H20 (0;08%TFA)/ ACN (0.08%TFA)). Yield 2. 7 
mg. 

N-(d-Biotinyl-sarcosinyl)-12-aminododecanoic acid (36). Compound 4 (32.2 mg, 0.102 
mmole) was dried oyemight in vacuo (with P205). Dry DMF (2 ml) was added and the 
mixture was stin-ed with warming to affect dissolution under nitrogen, triethylamine (34 
mcL) was added followed by 1 (20.4 mcL. 0.1 15 mmole) added In two 10.2 mcL 
portions. 5 mln apart. The mixture was stired for 1 hr at room temperature under 
nitrogen. 12-Aminododecanlc acid (24.1 mg. 0.112 mmole. Sigma) was added In one 
portion, and the mixture was stin-ed at room temperature for 2 hr under nitrogen. CHas 
(80 mL) was added, and the organiq solution was washed with two 10 mL portions of 1 
M HCI. CHCI3 was removed by rotary evaporation, and residual DMF was removed by 
vacuum centrifugation. The compound was dissolved in methanol and purified by HPLC 
(Vydac 218TP prep column). Solvent program is: 0-10 miri, water with 0.06% TFA; 10- 
55 min, 0-100% methanol with 0.06% TFA, flow rate is 6 mL/min. Yield 31.7 mg. 1 H- 
NMR. ESI-MS, calculated 513.4. observed 513.4 (M + H)+ 

N-hydroxysuccinimidyl esterof 36 (37). Compound 36 (9:8 mg. 19 mcmoie) Is dissolved 
In 100 mcL of dry DMF under nitrogen. N-hydroxysuccinlmlde (2.2 mg. 19 mcmoie) 
was added followed by dicyclohexylcarbodllmide (3.9 mg. 19 mcmoie). The mixture 
was stirred at room temperature for 60 h In the dark. Solvent was removed by vacuum 
centrifugation. and the residue was submitted to flash chromatography oh silica gel using 
a gradient of CHCI3/CH30H (15/1) to CHCI3/CH30H (12/1). Yield 9.8 mg. 1H-NMR. 
ESI-MS. calculated 610.8. observed 609.7 (M + H)+ 

N-(N-(d-Biotinyl-sarcosinyl)-12-aminododecanoyl)-pyschosihe (38). Compound 37 (6.2 
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mg, 1 0 mcmole) and pyschosine (4.7 mg, 1 0 mcmole. Sigma) were dissolved in 200 
mcL of dry DI\^F under nitrogen. Diisoproylethylamine (5 mcL) was added, and the 
mixture was stirred undfer nitrogen for 2 days in the dark. The compound was Injected 
directly onto the HPLC column (Vydac 21 8TP semi-prep), and the column was 
developed at 2 mL/min with 0-20 min. water with 0.06% TFA. then 20-80 min. 0-iOO% 
methanol with 0.06% TFA. Yield 3.8 mg. 1H-NMR. ESI-MS, calculated 957.3. observed 
956.8 (M + H)+ 

N-(N-(d-Blotinyl-sarcosinyl)-12-aminododecanbyl)-sphingosylphosphorylcholme(39). 
Sphfngosylphosphorylcholine (4.0 mg. Sigma) was mixed with 1 mL dry DMF and 
solvent was removed by vacuum centrifugation. This was repeated two more times. The 
final dried residue weighed 2.5 mg (5.4 mcmole). To this residue was added 3.3 mg of 
37 (5.4 mcmole). 150 mcL of dry DMF. and 2.5 mcL of diisoproylethylamine. The 
mixture was stin-ed under nitrogen in the dari< for 3 days. The compound was injected 
directly onto the HPLC column (Vydac 21 8TP semi-prep), and the column was 
developed at 2 mUmin with 0-20 min, water with 0.06% TFA. then 20-80 min, 0-100% 
methanol with 0.06% TFA. Yield 3.8 mg. 1 H-NMR. ESI-MS. calculated 960.3. observed 
958.7 (M + H)+ 



Conjugate of d-biotin With 1.13-diamino-4.7.10-trioxatridecane (40). Compound 2 was 
.reacted with 1.13-dlamino-4.7.10-trloxatridecane (Fluka) essentially as described for the 
synthesis of 3. The product was purified by HPLC (Vydac 218TP. semi-prep) using 0- 
100% methanol with 0.06% TFA over 30 min at 1 .5 mUmin. 

lodoacetylated 40 (41 ). Compound 40 was treated with 5 equivalents of iodoacetic 
anhydride (Aldrich) in dry DMF with stirring under nitrogen for 4 h at room temperature; 
The product was purified on HPLC as for 40. The structure was confirmed by ESI-MS. 

Octadeuterated 41 (42). The title compound was prepared as for the 40 using 9 instead 
of 1 .13-diamino-4.7.10-trioxatridecane. 
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Octadeuterated 42 (43). The title compound was prepared from 42 as for 41. the 
structure was confirmed by ESI-MS. 
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Exemplary MS" Techniques and Instrumentation 

An automated LC-MS/MS system for the identification of proteins by their amino add 
sequence has been developed. A schematic representation Is shown In Fig. 7 The system, 
which consists of an autosample. a capillary HPLC system connected on-line to ^n ESI triple 
quadrupole MS/MS Instmment and a data system is operated in the following wdy: Proteins 
(tyjDically separated by 1D or 2D gel electrophoresis) are cleaved with a specific protease, 
usually trypsin, the resulting cleavage fragments are placed in an autosampler. Every 37 
minutes the autosampler Injects one sample into the HPLC system and the peptides are 
separated by capillary reverse-phase chromatography. As separated peptides elute from the 
chromatography column, they are ionized by the ESI process, enter the MS and the mass to 
charge ratio (m/z) is measured. Any peptide ion whose intensity exceeds a predetemiined 
intensity threshold is automatically selected by the instrument and collided In the collision cell 
with inert gas. These collisions result in peptide fragmentation, primarily at the bonds of the 
peptide backbone (collision induced dissociation. CID). The masses of the CID fragments 
are measured and recorded in the data system. The CID spectmm of a peptide cbntaihs 
sufficient information to identify the protein by searching sequence databases with the 
uninterpreted MS/MS spectra. This is accomplished with the Sequent program, the 
program identifies each peptide in a sequence database which has the same mass as the 
peptide that was selected In the MS for CID and predicts the MS/MS spectmm for each one 
of the isobaric peptides. By matching the experimentally detennined CID spectrum with 
computer generated theoretical CID spectra, the protein from which the observed peptide 
originated is identified. The system is capable of analyzing protein samples In a fully 
automated fashion at a pace of less than 40 min. per sample. Since each peptide represents 
an independent protein identification and usually multiple peptides are derived from one 
protein, protein identification by this method is redundant and tolerant to proteins co- 
migrating in a gel. The system is well suited for the detection and characterization of 
modified residues within polypeptide chains. The LC-MS/MS technique and automated 
analysis of the generated CID spectra can be used for the methods of this Invention. 

aomrrS* llclrlnh ^ sensitivity by solid-phase extraction 
capillary lectrophoresis tandem mass spectrometry (SPE-CE-MS/MS) 
Protein identification by this method is based on the same principle as described 

above, except that peptide separation and ionization are perfomied at significantly higher 
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sensitivity. Fig.8 shows a schematic representation of the key design elements. The design 
of the system and its mode of operation have been published: Peptides derived from protein 

digests are concentrated by SPE. separated by CE and analyzed by ESI-MS/MS. The 
resulting uninterpreted CID spectra are used to search sequence databases with the 
Sequest software system. The SPE extraction device is a small reversed-phase 
chromatography column of the dimensions 0.18 x 1 mm which is directly pack^ in a fijsed 
silica separation capillary. Peptides contained in a sample solution are adsorbed and 
concentrated on the SPE device, eluted in anestimated 100- 300 nl of organic solvent and 
further concentrated by electrophoretic stacking and/or isotachophdresis to an estimated 
volume of 5-30 nl. The peptides are then separated by GE in a 20 pm or 50 pm i.d capillary 
and directly ionized by ESI as they leave the capillary (see reference 13 for design of the 
microspray ionization source). With this system, peptide masses can be detemiined at a 
sensitivity of 660 attomoles (approx. 500 fg for a 20 residue peptide) at a concentration limit 
of 33 amol/Ml and that proteins can be Identified by the CID spectra of automatically selected 
peptides at less than 10 fmol (0.5 ng for a protein of 50 kDa) of sample at a concentration 
limit of less than 300 amol/pl. this technique is used for the analysis at very high sensitivity 
of the peptide samples generated by the experiments. It has also been demonstrated that 
the analysis time available for automated CID experiments can be significantly extended by 

data-dependent modulation of the CE voltage. If several peptide ions are detected 
coincidentally in the MS. the CE voltage is automatically dropped. This results in a reduction 
of the electroosmotlcflow outof the capillary and therefore in an extension of the time period 
available for selecting peptide ions for CID. The net effect of this peak paricing t^chnlqu^ Is 
an extension of the dynamic range of the technique because the increased time available is 
used for CID of ions with a low ion current Once all the peptide ions are analyzed 
electrophoresis is automatically reaceelerated by increasing the CE voltage to the original 
valu . 
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TABLE 1 . Relative, redundant quantitation of a-lactalbunnin abundance (after mixing 
with known amount of tlie same protein with cysteines modified with isotopically 
heavy biotinylating reagent) 



Pentid 
e# 


(light) 


e state 


r^cpiiu6 oequence 


Ratio 
(heavyilight) 


1 


518.4 


2+ 


(K) IWCK 


2.70 


2 


568.4 


2+ 


YK^ALCSEK 




3 


570.4 


2+ 


(K)CEVFR 


2.90 


4 


760.5 


2+ 


(K) LDQWLCEK 


2.82 


5 


710.1 


3+ 


(K) FLDDDLTDDIMCVK 


2.88 


6 


954.2 


3+ 


(K) DDQNPHSSNICNISGDK 


2.90 


7 


1286. 
9 


4+ 


(K) GYGGVSLPEWVCTTFHTSGYDT 
QAIVQNNDSTEYGLFQINNK 


NA' 



^ Isotope ratio was not analyzed because on a 4* peptidie the isotope patterns were 
highly overlapping due to differences of only 2 amu between heavy and light ibns.i 
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TABLE 2. Sequence identification and quantitation of the components of a protein 
mixture in a single analysis. 



Gene Name* Peptide sequence identified Observed Mean±SD Expected % 



ratio 
(d0/d8)* 



ratio error 
(dO/dSr 



LCA BOVIN 



ALC#SEK 
C#EVFR 

FLDDLTDDIIVIC#VK 



0.94 
1.03 
0.92 



0.96±0.06 



1.00 



4.2 



OVAL CHICK 



ADHPFLFC#IK 
YPILPEYLQC#VK 



1.88 
1.96 



1.92±o:06 



2.00 



4.0 



BGAL ECOLI 



LTAAC#FDR 

IGLNC#QLAQVAER 

IIFDGVNSAFHLWC#NGR 



1.00 
0.91 
1.04 



0.98±0.07 



1.00 



2.0 



LACB BOVIN 



WENGEe#AQK 
LSFNPTQLEEQC#HI 



3.64 
3.45 



3.55±0.13 



4.00 



11.3 



G3P_RABIT VPTPNVSWDLTC#R 0 54 

■ . IVSNASC#TTNC#LAPLAK 0.S7 



0.56±0.02 



PHS2 RABIT 



0.32±0.03 



0.50 



0.33 



12.0 



IC#GGWQMEEADDWLR 0 32 

TC#AYTNHTVLPEALER 0.35 
- WLVLC#NPGLAEIIAER 0.30 

* Gene names are according to Swiss Prot nomenclature (www.exoasv ch) 

Ratios were calculated for each peptide as shown in Fig. 3. 

= Bcpected ratios were calculated from the known amounts of proteins present in each mixture 

# ICAT-labeled cysteinyl residue. 



3.1 
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TABLE 3. Protein profiles from yeast growing on galactose or ethanol as a carbon source. 



Gene 
name* 


Peptjde sequence identified 


Observed 
ratio* 

\E— u l.Oal ) 


Galactose- 
repressed^ 


1 

repressed^ 


ACH1 


KHNC#LHEPHMLK 


>inn * 1 


✓ . 




. ADH1 


YSGVC#HTDLHAWHGDWPLPVK 
C#C#SDVFNQWK 


0.57:1 
0.48: 1; 






ADH2 


YSGVC#HTbLHAWHGbWPLPTK 
C#SSDVFNHWK 


>200 : 1 
>200 : 1 






ALD4 


TFEVINPSTEEEIC#HIYEGR 


>100:1 


, V.':. 


y 


BMH1 


SEHQVELIC#SYR 


0.95 : 1 






CDCi9 


YRPNC#PIILVTR 

NC#TPKPTSTTETVAASAVAAVFEQK 
AC#DDK 


0.49 : 1 
0.65 : 1 

U.O/ . J . 






FBA1 


SIAPAYGIPWLHSDHC#AK 
EQVGC#K 


u.ou : 1 
0.63 : 1 






GAL1 


LTGAGWGGC#TVHLVPGGPNGNIEK 


1:>200 






GAL10 


HH!PFYEVDLC#DR 
DC#VTLK 


1 : >200 
1 : >200 






GCY1 


LWC#TQHHEPEVALDQSLK 


0.34:1 






GLK1 


IC#SVNLHGDHTFSMEQMK 


0.65:1 






GPD1 


IC#SQLK 


0.54:1 






ICL1 


GGTQC#SIMR 


>100: 1 






IPP1 


NC#FPHHGYIHNYGAFPQTWEnPNVCi- 
HPETK 


■ U.fD . 1 






LPD1 


VC#HAHPTLSEAFK 


1.30 : 1 






PEP4 


KGWTGQYTLDC#NTR 


2.60 : 1 






PSA1 


SWLC#NSTIK 


0.56 :. 1 






PGM2 


C#TGGIILtASHNPGGPENDMGIK 
LSIC#GEESFGTGSNHVR 


0.58 : 1 
0.62 : 1 






PCK1 


C#PLK 

IPC#LADSHPK 

C#INLSAEKEPEIFDAIK 

C#AYPIDY!PSAK 

IVEEPTSKDEIWWGPVNKPC#SER 


1.59: 1 
1.47 : 1 
1.52 : 1 
1.41 : 1 
1.85: 1 






QCR6 


ALVHHYEEC#AER 


1.30:1 






RPLIAif 


SC#GVDAMSVDDLKK 


0.82 : 1 






SAH1 


HPEMLEDC#FGLSEETTTGVHHLYR 
EC#INIKPQVDR 


0.62 : 1 
0.74 : 1 
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SOD1 



QFHIHEFGDATNGG#VSAGPHFNPFK 



0.46 : 1 



TEF1 RGNVG#GDAK 
C#GGiDK 

FVPSKPMC#VEAFSEYPPLGR 



0.81 : 1 
0.70 : 1 
0.74 : 1 



VMA2 IPIFSASGLPHNEIAAQIC#R 



0.70 : 1 



YHB1 HYSLC#SASTK 



0.69 : 1 



* Gene names are according to the Yeast Proteome Database (YPD) (19) — ^ 

# Cysteinyl residue is ICAT-labeled. ' / v 
I Protein expression ratios were calculated as described in Fig 3 

- Carbon source for yeast growth was 2% ethanol (Eth) or 2% gaiactose (GALO 
^ Gene is known to be galactose- or glucose-repressed (1 9) 
Eight other ribosomal proteins were detected at similar gene expression levels 
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TABLE - 4 

Enzyme [ Dysfunction~ 



Butvrvlcholine^tpr^i^^ Hpfinionr*\/ 


bunt 


Decreased or absent 
enzyme activity 


fructokinase deficiency 


Fructokinase 


Deficient enzyhie activity 


Hereditary fructose intolerance 


Fructose 1,6- 
bisphosphate aldolase B 


Deficient enzyme activity 


Hereditary fructose 1,6- 
diphosphatase deficiency 


Fructose 1 ,6- 
bisphosphatase 


Deficient enzyme activity 


Erythrocyte aldolase deficiency 
with nonspherocytic hemolytic 
anemia (^aiaoiase A deiiciency) 


Fructose 1 ,6- 
bisphosphate aldolase A 


Deficient enzyme activity 


Glycogen storage disease type 
la (yon oierKe disease) 


Glucose 6-phosphatase 


Absent or deficient 
eniyme activity 


Glycogen storage disease typie 
lb 


Glucose 6-phosphate 
translocase 


Deficient transport of 
glucose 6-phosphate 
across the membrane of 
endoplasmic reticulum 


Glycogen storage disease type 

III 
III 


Amylo-1 . 6-glucosidase 
(debrancher enzyme) 


Absent or deficient 
enzyme activity 


Glycogen storage disease type 

IV. \^AAI lUCl opi 1 UloCOSe^ 


a-1, 4 glucan-6-a- 
glucosyltransferase 


Deficient enzyme activity 


Glycogen storage disease type 

V ^iviu/Al Ulc Ulocaoc } 


Muscle glycogen 
phosphorylase 


Absent or deficient 
enzyme 


Glycogen storage disease X- 
linked phosphprylase kinase 
deficiency 


Phosphorylase i)-kinase 


Deficient or absent 

on7\/r)no fitn/^ti/\n 

csfi^cyiiic; ciuiiviiy lunciion 


Glycogen storage disease 
autosomal phosphorylase 
kinase deficiency 


Phosphorylase b-kinase 


Deficient enzyme activity 


Glycogen storage disease liver 
phospihorylase deficiency 


Liver phosphorylase 


Deficient enzyme activity 


Glycogen storage disease type 
Vjl (Tarui disease) 


Muscle 
phosphofructokinase 1 


Deficient enzyme activity 


Liver glycogen synthase 
deficiency 


*• 

Liver glycogen synthase 


Unknown 
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Phosphoglycerate kinase 

-UdlwIdlLry 


Phosphoglycerate kinase 


Deficient enzyme 


Phosphoglycerate mutase 

u CI 1 Lri c 1 1 oy 


Phosphoglycerate 
mutase 


Deficient enzyme 


Muscle lactate dehydrogenase 
deficiency 


Muscle-specific subunit of 
lactate dehydrogenase 
(LDH) 


Absence of M subunit of 
LDH. Muscle LDH is a 
tetramer of the heart- 
specific subunit 


Glucose phosphate isomerase 
deficiency 


Glucose phosphate 
ispmeraise 


Unknown 


Transferase deficiency 
galactosemia 


Galactose 1 -phosphate 
uridy transferase 


Deficient enzyme activity 


Galactokinase deficiency 
galactosemia 


Galactokinase 


Deficient enzyme activity 


Epimerase deficiency 
galactosemia 


Uridine diphosphate 
galactose-4-epimerase 


Deficient enzyme action 

in blood cells only 
(benign) or, more rarely, 
in all tissues (generalized) 


Phenylketonuria (PKU) due to 
r Mn aeTiciency 


Phenylalanine 
hydroxylase (PAH) 


Deficient or absent PAH 
activity (<1% normal) 


Hyperphenylalaninemia due to 
DHPR-deficiency 


Dihydropteridine 
reductase (DHPR) 


Deficient or absent DHPR 
activity 


Hyperphenylalaninemia due to 
GTP-CH-deficiehcy 


Guanosine triphosphate 
cyciohydrolase (GTP-CH) 


Deficifent enzyme activity 


Hyperphenylalaninemia due to 
6-PTS-deficiency 


6-Pyruvoyl 
tetrahydropterin synthase 
(6-PTS) 


Deficient enzyme activity 


Oculocutaneous tyrosinemia 
(tyrosinemia type II; tyrosine 
amino-transferase deficiency) 


Tyrosine 
aminotransferase 


Decreased activity 


4-HydroxyphenylpyrUvic acid 
dioxygenase (tyrosinemia type 
III) 


4-Hydroxy-phenylpyruvic 
acid dioxygenase 


Decreased activity 


Maleylacetoacetate isomerase 
deficiency (tyrosinemia type lb) 
(tentative) 


Maleylacetonacetate 
isomerase 


Pr sumably decreased 
enzyme activity 
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Hepatorenal tyrosinemia 
(tyrosinemia type 1: 
fumarylacetoacetate hydrolase 
deficiency) 


Fumarylacetoacetate 
hydroxylase 


Deficient enzyme activity 


Carbamyl phosphate synthetase 
deficiency 


Carbamyl phosphate 
synthetase! 


Absent or deficient 
enzyme activity 


Ornithine transcarbamylase 
deficiency 


Omithine 
transcarbamylase 


At)5ent or reduced 
enzyme activity 


Argininosuccinic acid 
synthetase deficiency 


Argininosuccinic acid 
synthetase 


Deficient enzyme activity 


ArgininosucQinase deficiency 


Argininosuccinate lyase 


Deficient enzyme activity 


Arginase deficiency 


Liver arginase 


Deficient enzynne activity 


Familial hyperlysinemia (variant; 
s;accharopinuria) 


a-Aminoadipic 
semialdehyde synthase 


Deficient enzyme activity 


Maple syrup urine disease 
(MSUD) or branched chain 
ketoacidemia 


Branched-chain a-keto 
acid dehydrogenase 


Deficient or absent (<2%) 
BCKAD complex activity 

in mitochondria; 
immunologic absence or 

subunits; impairment of 
El subunit assembly 


Cystathionine P-synthase 
dsficipnpv 


Cystathionihe P-synthase 


Deficient enzyme activity 


a-Cystathionase deficiency 


a-Cystathionase 


Deficient enzyme activity 


Hepatic methionine 
adendsyltransferase deficiency 


Isoenzyme of methionine 
adenosyltransferase 


Deficient enzyme activity 


Sarcosinemia 


Sarcosine 
dehydrogenase? 


Deficient enzynrie activity 


Nonketotic hyperglycinemia 


Glycine cleavage system 


Deficient enzyme activity 


Hyperuracil thyminuria 


Dihydropyrimidine 
dehydrogenase 


Deficient enzyme activity 


Dihydropyrimidinuria 


Dihydropyrimidinase 


Unknown 


Pyridoxine dependency With 
seizures 


Brain glutamic acid 
decarboxylase-1 


Deficient coenzyme 
binding? (brain) 
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v3A\D/\ arninoiransTerss© 
deficiency 


GABA-a-ketoglutarate 
transaminase 


Deficient enzyme activity 


**-nyaroxyDUiync aciound 


Succinic semialdehyde 
dehydrogenase 


Deficient enzyme activity 


Serum carnbsinase deficiency 
and homocamosiriosis 


Serum camosinase 


Deficient enzyme 


Alkaptonuria 


Homogentisic acid 
oxidase 


Absent or deficient 
enzvme actlvitv 


Isovaleric acidemia 


Isovaleryl-CoA 
dehydrogenase 


Deficient enzyme activity, 
deficient protein, 
abnormal Debtidp* ^i^^ 


Isolated 3-methylcrotonyl-CoA 
carboxylase deficiency 


3-Methylcrotonyl-CoA 
carboxylase 


Deficient enzyme activity 


3"Methylglutaconic aciduria 
Mild form: 


3-Methylglutaconyl-CoA 
hydratase 


Deficient enzyme activity 


3-Hydroxy-3-methylglutaryl-CoA 
lyase deficiency 


3-Hydroxy-3- 
methylglutaryl-GoA lyase 


Deficient enzyme activity 


Mevalonic aciduria 


Mevalonate kinase 


Deficient enzyme activity 


Mitochondrial acetoacetyl-CoA 
thioiase deficiency 


Mitochondrial 
acetoacetyl-CoA thioiase 
(T2) 


Deficient enzvme activitv 
decreased protein, 
unstable protein 


Propionic acidemia (2 ripnallelic 
forms designated pccA and 
pccBC) 


Propionyl-CoA 
carboxylase (PCC) 


Deficient enzyme activity 
(nonallelic forms reflect 
mutations in nonideritical 
subunits of PCC^ 


Methylmalonic acidemia (2 
allelic variants designated maf** 
and muf 


Methyldmalonyl-CoA 
mutase (MUT) 
apoenzyme 


Absent MUT activity in 
muf^ deficient MUT 
activity due to reduced 

affinity for cofactor 
(adenosylcobaliamin) in 
muf 


Glufaric acidemia type 1 


Glutaryl-GoA 
dehydrogenase 


Deficient enzyme activity 


Cytochrome oxidase deficiency 


Cytochrome oxidase 
pblypeptides 


Decreased activity of the 
cytochrome oxidase 
complex 
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Dysfunction 




Pyruvate dehydrogenase 

complex deficiency-Ev 
decarboxyfase component 


Pyruvate decarboxylase. 


Decreased enzyme 
activity, decreased 
protein 


Pyruvate dehydrpgenease Ej 
transacylase 


r^ i H \ /W r r\ 1 1 r\ r\ o rvi i a4 A 

L-'ii jyui uiipoamiue 

transacylase 


Decreased enzyrrie 
activity; abnormal protein 
electrophoretic mobility 


Combined a-ketoacid 

dehydrogenase 
deficiency/lipoamide 
dehydrogenase deficiency 


Lipoamide 
dehydrogenase 


Decreased enzyme 
activity 


Pyruvate carboxylase deficiency 


Pyruvate carboxylase 


Absent enzyme activity; 7 
cases absent enzyme, 
protein, and mRNA . 


Carnitine palmitoyi transferase 1 
(GPT 1) deficiency 


Carnitine palmitoyi 
transferase 1 


Deficient enzyme ; 


Camitine/acylcamitine 
transiocase deficiency 


Carnitine/ iacylcamitine 
transiocase 


Deficient transiocase 


Carnitine palmitoyi transferase II 
^Ori II; aeticiency 


Camitine palmitoyi 
transferase 11 


Deficient enzyme 


Very long-chain acyl-CoA , 
aenyarogenase ( VLUAU) 
deficiency 


Very long-chain acyl-CoA 
dehydrogenase 


Deficient enzyme 


Long-chain acyl-CoA 
uci lyuruyenase 


Long-chain acyl-CoA 
dehydrogenase 


Deficient enzyme 


Long-chain L-3-hydroxyacyl- 
CoA dehydrogenase (LCHAD) 
aeiiciency 


L-3-hydroxyacyl-CGA 
dehydrogenase 


Deficient enzyme 


Trifunctional enzyme (TFE) 
deficiency 


Trifunctional enzyme 


Deficient enzyme 


DiehdIylrCo reductase 
deficiency 


2,4-dienoyl-CoA 
reductase 


Deficient enzynie 


Medium-chain acyl-CoA 
dehydrogenase (MCAD 
deficiency 


Medium-chain acyl-CoA 
dehydrogenase 


Deficient enzyme 


Short-chain acyl-CoA 
dehydrogenase (SCAD) 
deficiency 


Short-chain acyl-CoA 
dehydrogenase 


Deficient enzyme 
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Glutaric acidemia type II 


Electron transfer 
Tiavoprotein (ETF); 
ETF:ubiquinohis 
oxidoreductase 


In some cases, no 
enzyme antigen; in 
pthehs, no enzyme 
activity 


Glycerol kinase deficiency (Gkd) 


Glycerol kinase 


Thet microdeletion 
involves not only GK but 
also the other deleted 
loci: AHC, DMD, OTC, 
and other linked loci 


Primary gout: superactive 

Varioni OT 

phosphoribosylpyrophosphate 
(PP-ribose-P) synthetase 


PP-ribose-P synthetasie 


Enhanced enzymei 
activity 


Primary gout: partial deficiency 
OT nypoxaninine guanine 
phosphoribosyltransferase 
(HPRT) 


Hypoxanthine guanine 
phbsphoribosyl 
transferase (HPRT) 


Absent or deficient 
enzyme activity 


Ltssun-iMynan synarome; 
deficiency of hypoxanthine 

guanine 
phosphoribpsyltransferase 

(HPRT) 


Hypoxanthine guanine 
phosphoribosyltrans- 
ferase (HPRT) 


Absent or deficient 
enzyme activity 


2.8-Dihydroxyadenine lithiasis 
(adenine 
phosphoribosyltransferase 
deficiency) 


Adenine 
phosphpribosyltransferas 

. - e 


Type 1: absent enzyme 
activity; type II: reducted 
affinity for PP-ribose-P 


Adenosine deaminase 
deficiency with severe combined 
immunodeficiency disease 


Adenosine deaminase 


Absent or greatly 
diminished enzyme 
activity 


Purine nucleoside 
phosphoryiase deficiency with 
cellular immunodeficiency 


Purine nucleoside 
phosphoryiase 


Absent or greatly 
dimiimished enzyme 
activity 


Myoadenylate deaminase 
deficiency 


Myoadenylate deaminase 
(AMPDI) 


No enzyme activity; on 
immunoreactive protein 


Xanthinuria 


Xanthine dehydrogenase 
(xanthine oxidase) 


Type 1: absent xanthine 
dehydrogenase activity; 
type ll:.absent xanthih 
dehydrogenase and 
aldehyd oxidase activity 
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Hereditary orotic aciduria 


UMP synthase 


Deficient enzyme activity 
(unstable protein) 


Pyrimidine 5 -nucleotidase 
deficiency 


Pyrimidine 5 - 
nucleotidase 


Absent or unstable 
enzyme 


Dihydropyrimidine 
dehydrogenase deficiency 


Dihydropyrimidine 
dehydrogenase 


Absent or unstable 
enzyme 


Dihydropryimidase deficiency 


Dlhydropyrimidase 


Absent or unstable 
en2^me 


Familial lipoprotein lipase 
deficiency 


Lipoprotein lipase 


rslonfunctional protein in 
some, nbndectable 
enzyme activity and 
piuiciii in oiners 


Farhilial lecithin: chloesterol 
acyltransferase deficiency 


Leclthinrcholesterol 
acyltransferase 


Absent enzyme protein or 
deficient en^vhnp ?irfi\/iK/ 


5-Aminolevulinic acid 
dehydratase porphyria 


6-Aminolevulinic acid 
dehydratase 


Minimal enzyme activity 


Acute intermittent porphyria 


Porphobilinogen 
deaminase 


Decreased enzyme 
activitv ('^50%) 


Congenital erythropoietic 
porphyria 


Uroporphyrinogen III 
cosynthase 


Minimal enzyme activity 


Porphyria cutanea tarda (familial 
form) 


Uroporphyrinogen 
decarboxylase 


Decreased enzyme 
activity (-50%) 


Hepatoerythropoietic porphyria 


Uroporphyrinogen 
debarbbxylase 


Mlninnial enzyme activity 


Hereditary coproporphyria 


Coproporphyrinogen 
oxidase 


Decreased enzyme 
activity (-50%) 


Variegate porphyria 


Prbtoporphyrinogen 
oxidase 


Decreased enzyme 
activity (-50%) 


Erythropoietic protophorphyria 


Ferrochelatase 


Decrease enzyme activity 
(-50%) 


Crigler-Najjar syndrome, type 1 


Bilirubin UDP- 
glucurpnosyltransferase 


Absent enzyme activity 


Crigler-Najjar syndrome, type II 


Bilirubin UDP- 
glucuronosyltransferase. 


Markedly reduced 
enzyme activity 
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Gilbert syndrome 


Bilirubin UDP- 
glucuronosyltransferase 
activity 


Reduced enzyme activity 


Refsum disease 


Phytanic acid a- 
hydroxylase 


Deficient enzyme activity 


Primary hyperoxaluria type 1 


Alanine-glyoxylate 
aminotransferase 


Loss of enzyme catalytic 
activity and aberrant 
subcellular distribution 


Primary hvoeroxaluria tvop o 


oiyoxyiate reductase/D- 
glycerate dehydrogenase 


Loss of enzyme catalytic 
activity 


Gk««* QariQliosirin^i^* 
hexosaminadase a-subunit 
deficiency (variant B, Tay-Sachs 
disease) 


3-hexosaminidase 


Absent or defective 
hexosamininidase A (ap) 
activity 


Glycogen storage diseases type 
II 


a-glucosidase 


Absent or deficient 
enzyme activity 


Mucopolysaccharidosis 1 
(Hurler, Scheie, and Hurler- 
Sheie syndromes. MPS, MPS 
IS, MPSIH/S) 


a-L-iduronidase 


Absent enzyme activity 


Mucopolysaccharidosis II 
(Hunter syndrome) 


Iduronate sulfatase 


ArlQOnt on7\/mA o/^i-K/il^* 

/^uaciii di^yme acuviiy 


Mucopolysaccharidosis III 
(Sanifiippo syndrome) types A, 
B, C and D 


IIIA: Heparan A/-sulfatase 
IIIB: d-A/-acetyl- 
giucosaminidase 
MIC: Acetyl-CoA: a- 

glueosaminide 
acetyltransferase 
HID: A/-acetyl- 
glucosamine-6-sulfatase 


Absent PnTvnrifa iaofi\/tf\/ 
/~\u/oc?i It cii^yiitc aLrLIVily 


Mucopolysaccharidosis IV 
(Morquio syndrome) types A 
and B 


iVA: Galactose 6- 
sulfatase 
IVB: P-Galactosidase 


Absent enzyme activity 


Mucopolysaccharidosis VI 
(Maroteaux-Lamy syndrome) 


A/-acetyI-galactbsamine 
4-sulfatase 


Absent enzyme activity 


Mucopolysaccharidosis VII (Sly 
syndrome) 


3-glucuronidase 


Absent enzyme activity 
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Enzyme 


. Dysfunction 




i-ceii Qisease (ML-iI) 


N-acetylglucosaminyl-l- 
phosphotransferase 


Phosphorylation of many 
lysosomal enzymes 


pcnindier disease (a-/v-acetyl- 
galactosaminidase deficiency) 


a^AA-acetyl- 
galactosaminidase 


Deficient activity of a-A/- 
acetylgalactosaminidase 


a-Mannosiddsis 


a-D-mannosidase 


Deficient or unstable 
enzyme activity 


P-Mannosidos!s 


P-D-mannosidase 


Deficient enzyme activity 


Sialidosis 


d-neuraminidase 


Deficient enzyme activity 


Aspartylglucosaminuria 


Aspartyglucosaminidase 


Deficient enzyme activity 


Fucosidosis 


d-L-fucosidase 


Deficient enzvmf* aptivitv 


Wolnrian disease and cholesteryl 
ester storage disease 


Acid lipase 


Deficient enzyme activity 


Ceramidase deficiency (F^arber 
lipdgranulomatosis) 


Ceramidase 


Deficient enzvme activltv 


Niemann-Pick disease (NPD) 
types A and B (primary 
sphingomyelin storage) 


Sphingomyelinase 


Deficient 
sphingomyelinase activity 


Gaucher disease type 1 
(non ne uronopath ic) 


Glucocerebrosidase 


Decreased catalytic 
activity arid some 
instability of enzyme 
protein 


Globoid-cell leukodystrophy 
(Krabbe disease) 


Galactosylceramidase 


Absent enzyme activity 


Metachromatic leukodystrophy 


Arylsulfatase A 


Deficient enzyme activity 


Fabry disease 


a-Galactosidase A 


Nonfunctional or unstable 
enzyme protein 


gangliosidosis 


Acid p-galactosidase 
(GLBI) 


Deficient enzyme activity 


gangliosidosis: 
hexosaminidase a-subunit 
deficiency (variant B, Tay-Sachs 
disease) 


P-hexosaminidase 


Absent or defectivie 
hexosaminidase A (ap) 
activity 


Steroid 21 -hydroxylase 
deficiency salt-losing form 


Steroid 21 -hydroxylase 


Absent or truncated 
enzyme vyith no activity 
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Steroid 5a-reductase 2 
deficiency 


Steroid Sot-red urta^f* P 


rMJoeni or unsiaDie 
enzyme activity 


Steroid sulfatase deficiency (X- 
linked ichthyosis) 


< SB-hvdroYv^tproiH 
sulfatase 


MDseni immunoreactive 
and enzymatically active 
protein (both deletion and 
nondeletion patients) 


Methylenetetrahydrofolate 
reductase deficiency 


Methylenetetrahydro- 
folate reductase 


Absent or deficient 
en^rme activity. 
Thermolabile variants 
have been described. 


Holocarboxylase synthetase 
deficiency 


n u 1 oca ID oxy las e 
synthetase 


Deficient holocarboxylas 
synthetase activity 


Bibtinidase deficiency 


Dioiiniaas& 


Deficient biotinidase 
activity 


Hereditary nnethemoglobihemia 

secondary to cytochrome bg 
reductase deficiency, types I, II, 

^rh^ III 

ana ni 


. v-^y locnrome Dg reductase 


Deficient enzyrhe activity 
in erythrocyte cytosdl only 
(type 1), in all tissues 
(type II), and in all 
hematopoetic cells (type 
III) 


Pyruvate kinase deficiency 
hemolytic anemia 


Pyruvate kinase 


Deficient enzyme activity 


Hexokinase deficiency 
hemolytic anemia 


Hexokinase 


Deficient enzyme activity 


v^iuouocpi luopjiaie isomerase.. 
deficiency hemolytic anemia 


Glucosephosphate 
isomerase 


Deficient enzyme activity 


Aldolase deficiency hemolytic 
anemia 


Aldolase (A tvoeV 


r^ofioionf ^ri-TXiry^A <«<^4-n#!^« 

L/diuiciu enzyme acuvicy 


Triosephosphate ispmerase 
deficiency hemolytic anemia 


Triosephosphate 
isomerase 


Enzyme activity deficient 
in all tissues 


Phosphoglycerate kinase 
deficiency hemolytic anemia 


Phosphoglycerate kinase 


Deficient enzyme activity 
in hemizygotes 


2,3-Diphospho-glyceromutase 
and phosphatasie deficiency 


2,3-Diphospho-glycerate- 
mutase and phosphatase 
(1 protein) 


Deficient enzyme activity 


6-Phosphoglucphate 
dehydrogenase deficiency 


6-Phosphogluconate 
dehydrogenase 


Enzyme activity 
deficiency 
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Dysfunction 




Glutathione peroxidase 
deficiency 


Glutathione peroxidase 


Diminished enzyme 
activity 


Glutathione reductase 
deficiency 


Glutathione reductase 


Deficient enzyme activity 


Glutathione synthetase 
deficiency hemolytic anemia 


Glutathione synthetase 


Deficient enzyme activity 


y-Glutamylcysteine synthetase 
deficiency hemolytic anemia 


Y-C3lutamylcysteine 
synthetase 


Deficient enzyme activity 


nvici iwoii 16 vjcdi 1 III idoc . 

hyperactivity hemolytic anemia 


Adenosine deaminase 


Overproduction of 
structurally normal 
enzyme protein mediated 
at mRNA translation level 


Pyrimidine nucleotidase 
ueiiciency nemoiyiic anemia 


Pyrimidine nucleotidase 


Deficient enzyme activity 


Myeloperoxidase deficiency 


Myeloperoxidase 


Absent or deficient 
enzyme activity 


Carbonic anhydrase Jl 
deficiency syndrome 
(osteopetrosis with renal tubular 
acidosis) 


Carbonic anhydrase II 


Quantitative deficiency of 
carbonic anhydrase II 


Albinism, oculocutaneous 
tyrosinase-negative type 
(OCAIA) 


Tyrosinase 


Absent, reduced, or 
unusual enzyme activity 


Canavan disease 


Aspartoacylase 


Deficient enzyme activity 
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Enzyme 


Substrate 




Product 


tntemal. 














standard 






(M + H)* 


(M + Na)* 


(M + H)* 


(M + Nar 


(M + H)* 


(M + Na)* 


Type la;b 


711 


733 


549 


571 


555 


577 


Manose- 
transferase 


725 


.747 


563 


585 


570 


592 


Type II 


1156 


1178 


1343 


1365 


1348 


1370 


Type 


1126 


1148 


2362^ 


2384^ 


2367* 


2389» 



(Mannose-6lcNAc)2 residue. 
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Scheme 1 



A. 



HN NH 




LINKER (heavy has 8 deuteriums) 



FUNCTIONAL GROUP 



Flowchart illustration of the procedure for quantitative protein profile 
measurements based on stable isotope ratios by mass spectrometry 



Sample protein mixture 



Reference Protein mixture 



Derivatize cysteines 
with light isotope reagent 



Derivatize cysteines 
with heavy isotope reagent 



Enzymatically cleave protein 



Affinity isolation of derivatized peptides 



Analysis and quantitation of derivatized peptides by LC-MS/MS 
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Scheme 4 




Enzyme 





Enzyme 



Linker 



Schemes 

Standardfs'^II^d ''^^''^^^^^^ Conjugates 1 and 2, Product Conjugates 3 and 4, and Intemal 




o 

T 

HN NH 



1 

• =0 = CH2 OH 

n = 1 

^OH 

n = 0 



3 R, = H 
n - 1 



4 Ri = H 

• =0 = CH2 



O ^ ~^"2 n = 0 



5 Ri = H 

• =0-CD2 
n = 1 

6 Ri = H 

= CH2 O = CD2 
n = 0 



77 



SUBSTITUTE SHEET (RULE 26) 



wo 00/11208 



PCT/US99/19415 




SUBSTITUTE SHEET (RULE 26) 



wo 00/11208 



PCT/US99/19415 



Scheme 7 




NH Me 




HO 




(CH2)i2d^3 



GalactosyU 
Cerebroside 



OH 



Glucose 



ATP 



Scheme 8 



Glucose-6-P 



Mamiose 



ATP 



Dol-P-mannose 



Dol = Dolichol 




7^ 

GDP Dol-P 



GDP-mannose 



GIT 



Fructose-6-P 

i 

PMlb 

Mannose-6-P 

i 

PMM2 

Mannose-l-P 
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Scheme 9 




Scheme 10 
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Scheme 1 1 
O 




BF3, EtjO 
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Scheme 12 (continued) 

PAC 




NH2-Try-Gln-Ser-Asn-Ser-Thr-Met-NH2 



B-S-NH-Try-Gln-Ser-Asn-Ser-Thr-Met-NH2 
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Scheme 14 




5 
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Scheme 16 
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Scheme 17 




15, X = OH 

16, x = ei 

17, X = CN 

18, X = CH2NH2 
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°-<Q 



NHAC 



19, X = NH2 20, X= 



O 
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21, X=GN 

22, X = CH2NH2 

25, x = CN 

26, X = CH2NH2 
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Scheme 20 
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Scheme 22 




NH 



36, X=OH O 



37, X- o— N 




H— (CH2)ii 




P 
X 



38, x = 



OH 



HO 




OH 



(CH2)i2 CH3 




(CH2)-i2 'CH3 
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We elaim: 



. A method for identifying one or more proteins or protein functions in one or more 
samples containing mixtures of proteins whicli comprises ttie steps of: 

(a) providing an affinity tagged, substantially chemically identical and 

differentially isotopically labeled protein reactive reagent for each sample 
wherein the reagent has the formula: 

A-L-PRG 

where A is an affinity label that selectively binds to a capture reagent, L is a linker 
group in which one or more atoms can be differentially labelled with one or more 
stable isotopes and PRG is a protein reactive group that selectively reacts with certain 
protein functional groups or is a substrate for an enzyme; 

(b) reacting each sample with one of the protein reactive reagents to provide affinity 
tagged proteins or affinity tagged enzyme products in the sample, affinity tagged 
proteins and enzyme products in different samples being thereby differentially labeled 
with stable isotopes: 

(c) capturing affinity tagged components of the samples using the capture reagent 
that selectively binds A; 

(d) releasing captured affinity tagged components from the capture reagent by 
disnjpting the interaction between the affinity tagged components and the capture 
reagent; and 

(e) detecting and identifying the released affinity tagged components by mass 
spectrometry. 

The method of claim 1 wherein the affinity tagged proteins in the samples are 
enzymatically or chemically processed before or after their capture to convert them 
into affinity tagged peptides. 

The method of claim 2 wherein the protein portion of one or more of the affinity tagged 
pnateins are sequenced by tandem mass spectrometry to identify the affinity tagged 
protein from which the peptide originated; 

The method of claim 1 wherein the protein portion of one or more of the affinity tagged 
pnateins are sequenced by tandem mass spectrometry to identify the protein. 

The method of claim 1 in which the amount of one or more proteins in the samples is 
also determined by mass spectrometry and which further comprises the step of 
introducing into a sample a known amount of one or more intemal standards for each 
of the proteins to be quantitated. 

The method of claim 1 wherein PRG is an enzyme substrate and the enzymatic 
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velocities of one or more enzymes in a sample are determined by quantitation of 
affinity tagged enzyme products and which further comprises the step of introducing 
into a sample a known amount of one or more internal standards for each of the 
affinity tagged enzyme products of an enzyme, the velocity of which is to be 
quantitated. ' 

7. The method of claim 1 wherein the released affinity tagged components are 
separated by chromatography prior to detecting and identifying the components bv 
mass spectrometry. K»'=«us,uy 

8. The method of claim 1 in which a plurality of proteins or protein functions in one 
sample are detected and identified. 

9. The method of claim 1 further comprising a step in which one or more of the proteins 
m a sample are chemically or enzymatically processed to expose a functional qrouD 
that can react with an affinity tag. » k 

I d. The method of cjaim 9 wherein the PRG is an enzyme substrate for one or more 
enzymes, the deficiencies of which, are linked to a disease state. 

11. The method of claim 9 wherein an affinity tagged, substantially chemically identical 
and differentially isotopicaliy labeled enzyme substrate is provided foreaCh enzyme 
that is to be detected and identified in a sample. 

The method of claim 1 wherein PRG is a protein reactive group that selectively reacts 
with certain protein functional groups and a plurality of proteins are detected and 
Identified in a single sample. 

The method of claim 1 1 wherein two or more affinity tagged, substantially chemically 
Identical and differentially isotopicaliy labeled protein reactive reagents having 
different specificities for reaction with proteins are provided and reacted with each 
sample to be analyzed. 

1 4. The niethod of claim 1 3 wherein .all of the proteins in a sample are detected and 
identified. 

15. The method of claim 1 wherein the relative amounts of one or more proteins in two or 
more different samples are determined and which further comprises the steps of 
combining the differentially labeled samples, capturing affinity tagged components 
from the combined samples and measuring the relative abundances of the affinity 
tagged differentially labeled proteins or the affinity tagged differentially labeled 
peptides originating from that protein. 

16. The method of claim 1 5 which determines the relative amounts of mfembrane proteins 
in oi^e or more different samples. 

1 7. The method of claim 1 5 in which different samples contain proteins originating from 
different organelles or different subcellular fractions. 
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The method of claim 15 in which different samples represent proteins expressed in 
response to different environmental or nutritional conditions, different chemical or 
physical stimuli! or at different times. 

A method for determining relative expression levels of proteins in two or more 
samples.containing proteins which comprises the steps of: 

(a) providing an affinity tagged, substantially chemically identical and differentially 
isotopically labeled protein reactive reagent for each sample wherein the reaaent has 
the formula: ' 

A-L-PRG 

where A is an affinity label that selectively binds to a capture reagent. L is a linker 
group which can be differentially labelled with stable isotopes and PRG is a protein 
reactive group that selectively reacts with certain protein functional groups: 

(b) reacting each sample with one of the protein reactive reagents to provide affinity 
tagged proteins in the sample, affinity tagged proteins in different samples, being 
thereby differentially labeled with stable isotopes; 

(c) combining the differentially labeled samples and treating the combined sample to 
cleave the proteins therein and to generate peptides; 

(d) capturing affinity tagged differentially labeled peptides Of the combined sample 
using the capture reagent that selectively binds A; 

(e) releasing captured affinity tagged differentially labeled peptides from the capture 
reagent by disrupting the interaction between the affinity tagged peptides and the 
capture reagent; and 

(e) detecting and identifying the released affinity tagged differentially labeled peptides 
by mass spectrometry; and 

(0 measuring the relative abundances of the isotopically distinct ion peaks generated 
from an affinity tagged differential labeled peptide to determining relative expression 
levels of the protein from which the affinity tagged differential labeled peptide 
originated. 

A reagent for mass spectrometric analysis of proteins which has the general 
formula: 

A-L-PRG 

where A is an affinity label that selectively binds to a capture reagent, L is a linker 
group which can be differentially labelled with stable isotopes and PRG is a protein 
reactive group that selectively that selectively reacts with certain protein functional 
groups. 
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The reagent of claim 20 wherein PRG is a sulfhydryl reactive group or an amine 
reactive group. 

The reagent of claim 20 wherein PRG is an enzyme substrate. 

The reagent of claim 20 wherein the A-L-PRG is soluble in a sample liquid to be 
analyzed. . 

The reagent of claim 20 wherein the linker is a cleavable linker. 
The reagent of claim 20 which has the general formula: 

A-B1.X1-{CH^)n.[X2.{CH2)m ]x.X3^CH2)p .X4.B2.PRG 

where: A is an affinity label; 

PRG is a protein reactive group; and 

B1-X1-(CH2)n-[X2-(CH2)m ]x-X3-(CH2)p -X4-B2 is a linker group wherein: 

X1 , X2. X3 and X4, independently of one another, and X2 independently of other X2 

rcTMl-t^o^"^ ?' ^OO. COS. S-S. SO. S02. CO-NR'.' 

CS-NR . Si-O. aryl or diaryl groups or X1-X4 may be absent; 

°^ one another, are optional groups selected from COO 
CO. CO-NR'. CS-NR'. ( CH2 )q-CONR'. (CH2)q-CS-NR\ or (CH2)q; ' 

n m. p. q and x are whole numbers that can take values from 0 to about 100 where 
thesumof n+xm+p+q is less than about 100; 

R is an alkyl. alkenyl. alkynyl, alkoxy or an aryl group that is optionally substituted with 
one or more alkyl. alkenyl. alkynyl, or alkoxy groups; and 

R' is a hydrogen, ah alkyl. alkenyl. ajkynyl. alkoxy or an aryl group that is optionally 
substituted with one or more alkyl. alkenyl. alkynyl. or alkoxy groups 

wherein one or more of the CH2 groups in the linker can be optionally substituted with 
aikyi. a kenyl alkoxy groups, an aryl group that is optionally substituted with one or 
more alkyl. alkenyl. alkynyl. or alkoxy groups, an acidic group, a basic group or a 
group canying a permanent positive or negative charge; wherein one or more single 
bonds linking non-adjacent CH2 groups in the linker can be replaced with a double or 
a tnple bond and wherein one or more of the atoms in the linker can be substituted 
with a stable isotope. 

The reagent of claim 20 wherein the affinity label is biotin or a modified blotin. 

The reagent of claim 20 wherein the affinity label is selected from the group 
consisting of a 1 .2-diol. glutathione, maltose, a nitrilotriacetic acid group; or an 
oligohistidine. 
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28. The reagent of claim 20 wherein the affinity label is a hapten. 

29. The reagent of claim 20 wherein PRG is a sulfhydryl-reactive group. 

30. The reagent of claim 20 wherein PRG is an iodoacetylamide group, an epoxide, an. a- 
haloacyl group, a nitriles, a sulfonated alkyi, an aryl thiols or a maleimide. 

31 . The reagent of claim 20 wherein PRC is an amine reactive group, a group that 
reacts with a homoserine lactone or a group that reacts with carboxylic acid group. 

32. The reagent of claim 20 wherein PRG is selected from the groups consisting of a 
amine reactive pentafluorophenyl ester group, an amine reactive N-hydroxy 
succinimide ester group, sulfonyl halide, isocyanate. isothiocyanante. active ester, 
tertafluorophenyl ester, an acid halide, and an acid anyhydnde; a homoserine lactone 
reactive primary amine group, and an carboxylic acid reactive amine, alcohols or 
2,3,5.6-tetrafluorophenyl trifluoroacetate. 

33. The reagent of claim 20 wherein PRG is a substrate for an enzyme, 

34. The reagent of claim 20 wherein PRG is a substrate for an enzyme the deficiency of 
which is associated with a birth defect. 

35. The reagent of claim 20 wherein PRG is a substrate for an enzyme the deficiency 
of which is associated with a lysosomal storage disease. 

37. The reagent of claim 20 wherein PRG is a substrate for P-galactosidase, 
acetyl-a-D-glucosaminidase. heparan sulfamidase. acetyl-CoA-a- D-glucosaminide 
N-acetyltransferase or N-acetylglucosamine-6-sulfatase. 

38. The reagent of claim 20 wherein at least one of B1 or B2 is CO-NR' or CS-NR. 

39. The reagent of claim 20 wherein X1 and X4 are selected from NH, NR. and 
NRR'+. X3 is O and all X2 groups are O. 

40. The reagent of claim 20 wherein the linker contains a disulfide group. 

41 . The reagent of claim 20 wherein any atom of the linker may be substituted with a 
heavy isotope. 

42. A reagent kit for the analysis of proteins by mass spectral analysis that comprises a 
reagent of claim 20. 

43. The reagent kit of claim 42 that comprises one or more reagents of claim 20. 

44. The reagent kit of claim 42 further comprising one or more proteolytic enzymes for 
use in digestion of affinity tagged proteins. 

45. The reagent kit of claim 42 which comprises a set of substantially chemically 
identical differentially labelled affinity tagged reagents. 
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46. The reagent kit of claim 42 wheriin the reagent is an affinity tagged enzyme 
substrate reagent. 

47. the reagent kit of claim 46 which comprises a set of substantially chemically 
identical differentially labeled affinity tagged enzyme substrates. 

48. The reagent kit of claim 47 further comprising a set of substantially chemically 
identical differentially labeled affinity tagged enzyme products. 
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Fig. 1A Fig. 1B 




Height Ratio (dO/dS) 



Standard curve generated with a cysteine-biotinylatad peptide and 
quantitation by stable isotope dilution. A) Zoom-scan from an ion-trap mass 
spectrometer showing a 4 amu isotope distribution for the (M+2HP ions of the 
peptide modified with the isotopically light (1457.9 u) and heavy (1461 8) 
biotlnylating reagents. The ratio (d0/d8) was 4.54. B) Curve generated from 
the analysis of isotope ratios from zoom-scans of 5 different concentrations of 
dO-labeled peptide measured in the presence of a known amoqnt of peptide 
labeled with the isotopically heavy reagent. 
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»/ ^-x- "lass spectrum of a cysteine-modified peptide from a-lactalbumin 

Modification of the cysteine residue with the custom synthesized biotinylating reagent 
did not affect the ability of the Sequest computer program to correctly match this 
peptide to a database sequence. 



Fig. 2 



wo 00/11208 



3 / 8 



PCt/US99/19415 



Fig. 3A 



lOOi 564, 



lOOi 

s I 



c 

C 



568 



1360 



500 



600 



j 994 



998 



1184 

1176 : 



1376 



700 



800 



900 



1000 

m/z 



1100 1200 13O0 1400 1500 



Fig. 3B 



100-| 

S 8oJ 

c i 

-S i 

-o - . • 
< 

Q 40 
» 20- 



997.7 

i 



Fig. 3C 



100-1 



Area: 6.16x10^ 



Ught 



993.8- 



Heavy . 



g 50-1 

■CO ; 

I 0^ 

< 100- 

CD . : 

> 

I 50 1 

cc i 




m/z^ 993.8 i: 0,5 



Area: 11.33 x 10^ 




m/fe 997.7 ± 0.5 



960 



990 1000 
m/z 



1010 



19.0 19.5 20.0 20.5 
Time (mln) 



r 



wo 00/11208 PGT/US99/19415 I 

4/8 



Fig. 4 A loon 



^ 75 



c 

CO 
T3 



NH^-V-P-T-P-N-V-S-V-V-D-L-T-C-R-cooH 



987 



1201 



< 50l 

.> ! 

25 



874 

C-hlCAT 



; r 11.02 




in.,!... 



! I 



V. 



1387 



1300 



V. 



v. 



600 



■ N. 



800 



I 

IjL 



I 



1000 



1200 
m/z 



-L- 



1400 



1600 



1800 



Fig. 4B 



s03 19_hlnavcid. 0364 . 03 54 . 2 . otic 

S amino acids = 93009on - — , 

^-wa - proreiM = 290043. 

C:M.CQ\darabase\owl.v31. 3, (C# +494 . 50) 



.Tiacched peptides =» 1973750 



Rank/Sp (M-*-K)- C*10*4 



Ions 



Rerers 



Peptide 



; r;:!-''-'-^ ^^''^^ A.29ir- 

5 /3c^ 195=,! ::...3. -3,33 ,._3,, 



( Y ) I.AI.LPS3AZGPKGCFVTDK 
( L ),ALLVI.VA?AilAAGNGSDLRN 



wo 00/11208 



5 / 8 



PCT/US99/19415 



Fig. 5A 



Glucose or Galactose 
2NAD" 



Oxidative Phosphorylation 

CO2+H2O 




Alcohol dehydrogenase isozymes 
2NADH 2NAD* 
DH 




Oxaloacetate 

r 

TCA Cycle' ^ 



2NADH 2NAD* 




CH3.CH2-OH 
t: Ethanol 



ADH1 : YSVC#HTDLHAWHGDWPLPyK 
100-. I 



(J 
C 
(0 

C 

J 50- 
< 

(D 
> 

i9 

cr 



1453.1 
J EthanoT^ \ 


^Galactose 







1430 1440 1450 1460 1470 1480 
m/z 



lOOn 



ADH2 c YSVC#HTDLHAWHGDWPLPIK 

1454;1. 



50- 



0- 







Ethanol^^ 


Galactose 







1430 1440 1450 1460 1470 1480 
m/z 



Ratio: 0.57 



Ratio: >200 



Fig. 5B 



Fig. 5C 



WQ 00/11208 



6 / 8 



PCT/US99/19415 



A) 1001 



i 



SFB Substncte 



aoj r««-i 
j 



E 60 

< ■ 

^ 40 



20 
0 

(B) 100 



SFB I.S. GM, I.S. 



GM, Substrate 



J. 



: 1(1 "I 



680 



700 



740 ^ 780 



820 



680 



1 60 i r^^n 

4 

c 1 



SFB Substrate GM, Substrate 



C) 



J 60 
< 

I 40 



20 

d 

10(9 



SFB Product GM, Product 



SFB I.S. 



GM, I.S. 



: 1 I i 



J 



660 



700 



c on 3 r*^n 

11 



740 780 820 

m/z 

SFB Substrate 



rx3-, 



= 60 I 



• f SFB Product 
5 : \ SFB I.S. GM, I.S. 



20 
0 



860 



GMi Substrate 



J 



Figure 



660 700 740 ^ 780 820 860 

m/z 



wo 00/11208 



7 / 



8 



PCT/US99/19415 




0 



Q CD 

r- S O «J 
5 i S ^ <^ t3 
"J o !;; <D c: 

cn CO -ii (ft 

il^li la 

<^ 5 o-c 
S ^ 5 S ^ m -5 

E ^ a o -s <i» 

To ci-S ^ f .12 

CO .52 "5 15 QJ a ^ 
9> § S 



I -| ^ -s* 



Cj> <u S- CO 
— > V 

a o s 2 J c ° 

CO 0) Qi CO Qj TO «) 

e"5 :£.<2 Qj ^-^ 

o </> ^ p QJ Q- 

^ ^ -e 13 a 

a i2 o o 

C O QJ qj 



CO 
03 



£ o CO 
CO "o E 

O C «) 



^ o a 



CO .9 

. ^ CO S ^ 

o S o CO .9 
^ CD CO it: 



■SCO ^ e s^g-'o 
05 ^ -.^ a Q. Q.^ 

qj Q eg - ' S> CO 




cvj CO 3: 0) CC to 



wo 00/11208 



8 / 



PCT/US99/19415 




4 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/US99/19415 



A. CLASSIFICATION OF SUBJECT MATTER 
.IPG(6) :C\2.<J 1/00; COIN 33/573, 33/53, 33/567, 24/00. 
US CL :435/4, 7.2. 7.4, 7.5; 436/173, 811, 813. 
According to ihtemiationai Patent Classification (IPC) or to both national classification arid IPC 



FIELDS SEARCHED 



i^inimum documentation searched (classification system followed by classification symbols) 
U.S.: 435/4,7:2. 7.4. 7.5; 436/173, 811, 813. 



Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 
NONE 



EliBctronic data base consulted during the international search (name of data base and, where practicable, search teiros used) 
WEST 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Caiegoiy^ 



Y, P 



Y. E 



Citation of document, with indication, where appropriate, of the relevant passages 



us 5,880, 270 A (BERNINGER et al.) 09 March 1999, col. 3, line 
55-col. 4, line 31. 

US 5,614,368 A (GHAZAROSSIAN et al.) 25 March 1997, cbl. 3, 
line 36-col. 6, line 15. 

US 5,534,132 A (VREEKE et al.) 09 July 1996, col. 2, line 25- 
col.3, line 5. 

US 5,527,711 A (TOM-MOY et al.) 18 June 1996, col. 2, line 61- 
col. 3, line 40. 

US 5,958,703 A (DOWER et al.) 28 September 1999, col. 2. line 6- 
col. 4, line 39, 



Relevant to claim No. 



1-45 



1-45 



1-45 



1-45 



1-45 



Further documents arc listed in the continuation oif Box C. . | | See patent family annex. 



Special categories of cited documents: 

document deftntng the general state of tlie.art which b not considered 
to be of particular relevance 

earlier document published on or after the international filing date 

document which may throw doubts on priority claim (s) or which is 
cited to esubltsh the publication date of another citation or other 
special reason (as specified) 

doctiment referring to an oral disclosure, use, exhibition or other 
.means 

document publbhed prior to the international filing date but later than 
the priority date claimed 



later document published after the international filing daU or priori^ 
date and not in conflict with the application but cited to understand 
the principle or theory underlying the invention 

document of particular relevance; the claimed invention cannot be 
considered novel or cannot be considered to involve an inventive step 
when the document is taken alone 

document of particular relevance; the claimed invention cannot be 
considered to involve an inventive step when the document is 
combined with one or more other such documents, such combination 
being obvious to a person skilled in the art- 
document member of the same patent family 



Dale of the actual completion of, the international search 



20 OCTOBER 1999 



Date of mailing of the international search report 



1 9 NOV ms 



Name and mailing address of the ISA/US 
Commissioner of Patents and Trademarks 
BoxPCT 

Washington, D.C. 20231 
Facsimile No. (703) 305-3230 



Authorized officer 

PENSEE T. DO i/j 
Telephone No. (703) 308-0196 




INTERNATIONAL SEARCH REPORT 



International application No. 
PCTAJS99/194i5 



C| (Continuation). DOCUMENTS CONSIDERED TO BE RELEVANT 


Category* 


Citation of document, with indication, where appropriate, of the relevant passages 


Relevant to claim No. 


Y 


us 5,514,559 A (MARKERT-HAHN et al.) G7 May 1996, col. 1. 
line 22-col. 3, line 5 


1-45 


A 


US 5,658.725 A (SCHLrePER et al.) 19 August 1997, col. 4, line 
28-col. 7, line 5. 


1-45 


A,P 


US 5,863,740 A (KIENTSCH-ENGEL et al.) 26 January 1999, col. 
2, line 13-col. 6, line 49. 


1-45 


A 


US 4,798,795 A (SIGLER) 17 January 1989, col. 1, line 60-col. 2, 
line 32. 


1-45 


A.E 


US 5,965,457 A (MAGNANI) 12 October 1999, cbl. 2, lines 12- 
68. 


1-45 


A.E 


US 5,965,131 A (GRIFFITHS et al.) 12 October 1999, col. 3. line 
52-col. 11, line 41. 


1-45 


A, P 


US 5,851,781 A (ADAMCZYK et al.) 22 December 1998, col. 5, 
line 20-col. 17, line 44. 


1-45 


A 


US 5,738,984 A (SHOSEYOV) 14 April 1998, col. 4, line 36-col. 
9, line 11. 


1-45 



Form PCT/ISAy210 (continuation of second sheetXJuly 1992)* 



